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ABSTRACT 


This monograph presents integrated modeling and controller design methods 
for flexible structures. The controllers, or compensators, developed are opti- 
mal in the 1 inear-quadratlc-Gaussian sense. The performance objectives, sensor 
and actuator locations and external disturbances influence both the construction 
of the model and the design of the finite dimensional compensator. The modeling 
and controller design procedures are carried out in parallel to ensure com- 
patibility of these two aspects of the design problem. Model reduction tech- 
niques are introduced to keep both the model order and the controller order as 
small as possible. 

A linear distributed, or infinite dimensional, model is the theoretical 
basis for most of the text, but finite dimensional models arising from both 
lumped-mass and finite element approximations also play an important role. A 
central purpose of the approach here Is to approximate an optimal infinite 
dimensional controller with an implementable finite dimensional compensator. 

Both convergence theory and numerical approximation methods are given. Simple 
examples are used to illustrate the theory. 
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1. Introduction 


Recent years have seen increasing research in active control of flexible 
structures. The primary motivation for this research is control of large 
flexible aerospace structures, which are becoming larger and more flexible at 
the same time that their performance requirements are becoming more stringent. 
For example, in tracking and other applications, satellites with large antennae, 
solar collectors and other flexible components must perform fast slew maneuvers 
while maintaining tight control over the vibrations of their flexible elements. 
Both of these conflicting objectives can be achieved only with a sophisticated 
controller. There are applications also to control of robotic manipulators with 
flexible links, and possibly to stabilization of large civil engineering struc- 
tures such as long bridges and tall buildings. 

A fundamental question that often arises with regard to designing a 
controller for a flexible structure is whether a finite dimensional model is 
sufficient as a basis for a controller that will produce the required perfor- 
mance, or is a distributed model necessary? The philosophy of this text is 
that, for almost any structure, a near optimal controller can be based on some 
finite dimensional approximate model but that, for large flexible space struc- 
tures, the necessary order of the approximate model can be determined only in 
the controller design process, with some reference to the distributed model of 
the structure. 

The main topic of this monograph is the design of LQG compensators based on 
finite dimensional approximations of distributed models of flexible structures. 

A primary objective of the approximation methods in subsequent chapters is con- 
vergence criteria that indicate the appropriate order of a finite dimensional 
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compensator for an infinite dimensional flexible structure. The basic idea in 
determination of the compensator order and gains is to approximate an ideal 
infinite dimensional compensator with an implementable finite dimensional com- 
pensator. 

This monograph deals with 1 inear-quadratlc-Gaussian (LOG) compensators. The 
LQG optimal control problem for distributed, or infinite dimensional, systems is 
a generalization to Hilbert space of the now classical LOG problem for finite 
dimensional systems. The solution to the infinite dimensional problem yields an 
infinite dimensional state-estimator-based compensator, which is optimal in the 
context of this monograph. By a separation principle [Bal, CP2], the problem 
reduces to a deterministic linear-quadratic optimal control problem and an opti- 
mal estimation, or filtering, problem with gaussian white noise. In infinite 
dimensions, the control system dynamics are represented by a semigroup of 
bounded linear operators instead of the matrix exponential operators in finite 
dimensions, and the plant noise process may be an infinite dimensional random 
process. The solutions to both the control and filtering problems involve 
Riccati operator equations, which are generalizations of the Riccati matrix 
equations in the finite dimensional case. Current results on the infinite 
dimensional LQG problem are most complete for problems where the input and 
measurement operators are bounded, as this monograph requires throughout. This 
boundedness also permits the strongest approximation results. For related 
control problems with unbounded input and measurement, see [Cul, CS1, LT1, LT2, 
LT3]. 

Our primary objective is to approximate the optimal infinite dimensional LQG 
compensator for a distributed model of a flexible structure with finite dimen- 
sional compensators based on approximations to the structure, and to have these 
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finite dimensional compensators produce near optimal performance of the closed- 
loop system. We discuss how the gains that determine the finite dimensional 
compensators converge to the gains that determine the infinite dimensional com- 
pensator, and we examine the sense in which the finite dimensional compensators 
converge to the infinite dimensional compensator. With this analysis, we can 
predict the performance of the closed-loop system consisting of the distributed 
plant and a finite dimensional compensator that approximates the infinite dimen- 
sional compensator. 

Our design philosophy is to let the convergence of the finite dimensional 
compensators indicate the order of the compensator that is required to produce 
the desired performance of the structure. The two main factors that govern rate 
of convergence are the desired performance (e.g., fast response) and the struc- 
tural damping. We should note that any one of our compensators whose order is 
not sufficient to approximate the infinite dimensional compensator closely will 
not in general be the optimal compensator of that fixed order; i.e., the optimal 
fixed-order compensator that would be constructed with the design philosophy in 
[BH1, BH2] . But as we increase the order of approximation to obtain con- 
vergence, our finite dimensional compensators become essentially identical to 
the compensator that is optimal over compensators of all orders. 

An important question, of course, is how large a finite dimensional compen- 
sator we must use to approximate the infinite dimensional compensator. In [GM1, 
GM2, GM3, MG1], we have found that our complete design strategy yields compen- 
sators of reasonable size for distributed models of complex space structures. 
This strategy in general requires two steps to obtain an implementable compen- 
sator that is essentially identical to the optimal infinite dimensional compen- 
sator: the first step determines the optimal compensator by letting the finite 
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dimensional compensators converge to it; the second step reduces, if possible, 
the order of a large (converged) approximation to the optimal compensator. The 
first step, which is the one involving control theory and approximation theory 
for distributed systems, is the subject of Chapters 7-9. For the second step, a 
simple modal truncation of the large compensator sometimes is sufficient, but 
there are more sophisticated methods in finite dimensional control theory for 
order reduction. For example [GM2, MG1], we have found balanced realizations, 
discussed in Chapter 5, to work well for reducing large compensators. 

The approximation theory in this monograph follows from the application of 
approximation results in [BK1, G13, G14] to a sequence of finite dimensional 
optimal LQG problems based on a Ritz-Galerkin approximation of the flexible 
structure. For the optimal linear-quadratic control problem, the approximation 
theory here is a substantial improvement over that in [Gil] because here we 
allow rigid-body modes, more general structural damping (Including damping in 
the boundary), and much more general finite element approximations. These 
generalizations are necessary to accommodate common features of complex space 
structures and the most useful finite element schemes. For example, we write 
the equations for constructing the approximating control and estimator gains and 
finite dimensional compensators in terms of matrices that are built directly 
from typical mass, stiffness and damping matrices for flexible structures, along 
with actuator influence matrices and measurement matrices. This means that the 
numerical methods in this text for compensator design can be used for almost any 
finite element model of a flexible structure, without reference to the infinite 
dimensional theory that establishes the validity of the numerical methods for a 
distributed model of the structure. 

For the estimator problem, this monograph presents rigorous approximation 
theory that has evolved from less complete results in previous research [GM1, 
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GM2, GM3, MG1]. As In the finite dimensional case, the infinite dimensional 
optimal estimation problem is the dual of the infinite dimensional optimal 
control problem, and the solutions to both problems have the same structure. 
Because we exploit this duality to obtain the approximation theory for the esti- 
mation problem from the approximation theory for the optimal control problem, 
the analysis in this monograph is almost entirely deterministic. We discuss the 
stochastic Interpretation of the estimation problem and the approximating state 
estimators briefly, but we are concerned mainly with deterministic questions 
about the structure and convergence of approximations to an infinite dimensional 
compensator and the performance — especially stability -- of the closed-loop 
systems produced by the approximating compensators. 

Now we will outline the technical contents of the monograph. Chapter 2 
discusses a finite dimensional model for the forced linear vibrations of a 
flexible structure. Although the main goal is to design compensators for infi- 
nite dimensional flexible structures, the finite dimensional approximations upon 
which implementable compensators are based amount to finite dimensional models 
of flexible structures. These approximate models have the form of the finite 
dimensional model in Chapter 2. Important features of flexible structures, like 
natural modes and damping, are discussed in Chapter 2. Extensions of these 
notions for flexible structures with gyroscopic forces are also discussed. 
Gyroscopic terms appear whenever the structure rotates or contains rotating ele- 
ments. 

Chapter 3 defines the abstract distributed model of an infinite dimensional 
flexible structure and the energy spaces to be used in most of the subsequent 
chapters. We assume a finite number of actuators, since this is the case in all 
applications, and we assume that the actuator influence operator is bounded. 
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Chapter 3 also establishes certain mathematical properties of the open-loop 
system that are useful in control and approximation. 

We develop the approximation of the distributed model of the structure in 
Chapter 4 and prove convergence of the approximating open-loop systems. The 
approximation scheme is essentially a Ritz-Galerkin method that includes modal, 
including component modal, approximations and most finite element approximations 
of flexible structures. For convergence, we use the Trotter-Kato semigroup 
approximation theorem, which was used in optimal open-loop control of hereditary 
systems in [BB1] and has been used in optimal feedback control of hereditary, 
hyperbolic and parabolic systems in [BK1, Gil, Gi3] and other papers. The usual 
way to invoke Trotter-Kato is to prove that the resolvents of the approximating 
semigroup generators converge strongly. To prove this, we introduce an inner 
product that involves both the strain-energy inner product and the damping func- 
tional, and show that the resolvent of each finite dimensional semigroup genera- 
tor is the projection, with respect to this special inner product, of the 
resolvent of the original semigroup generator onto the approximation subspace. 
The idea works as well for the adjoints of the resolvents, and when the open- 
loop semigroup generator has compact resolvent, it follows from our projection 
that the approximating resolvent operators converge in norm. 

The speed of convergence to the optimal feedback law and optimal estimator 
is affected by the manner in which the model order is Increased. It is useful 
to have a method of identifying important modes of the structure so they can be 
added to the model first. This issue is considered in Chapter 5. Moore's 
Balanced Realization Theory is introduced as a method for ordering the impor- 
tance of states of a structural model. The approach takes into account input 
and output coupling as well as frequency and damping. An efficient approxima- 
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1 1 on scheme Is developed for structures which are lightly damped. Balanced 
Realization Theory may also be used to reduce the order of a general matrix 
transfer function, and in this context it provides a tool for reducing the order 
of the compensator. 

Chapter 6 discusses the LQG optimal control problem for the distributed 
model of the structure and establishes some estimates involving bounds on solu- 
tions to infinite dimensional Riccati equations and open-loop and closed-loop 
decay rates. We need these estimates for the subsequent approximation theory. 

To get the approximation theory for the estimation problem, we have to give cer- 
tain results on the control problem in a more general form than would be 
necessary were we interested only in the control problem for flexible struc- 
tures. Therefore, in Chapter 6, we first give some generic results applicable 
to the LQG problem for a variety of distributed systems and then apply the 
generic results to the control of flexible structures. 

For closing the loop on the control system, we assume a finite number of 
bounded linear measurements and construct the optimal state estimator, which is 
infinite dimensional in general. The gains for this estimator are obtained from 
the solution to an infinite dimensional Riccati equation that has the same form 
as the infinite dimensional Riccati equation in the control problem. 

Since the approximation issues that this book treats are fundamentally 
deterministic, we make the book self-contained by defining the infinite dimen- 
sional estimator as an observer, although the only justification for calling 
this estimator and the corresponding compensator optimal is their interpretation 
in the context of stochastic estimation and control. We discuss the stochastic 
interpretation but do not use it. We say estimator and observer interchangeably 
to emphasize the deterministic definition of the estimator here. 
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Because we assume a finite number of actuators and a bounded input operator, 
the optimal feedback control law consists of a finite number of bounded linear 
functionals on the state space, which is a Hilbert space. This means that the 
feedback law can be represented in terms of a finite number of vectors, which we 
call functional control gains, whose inner products with the generalized dis- 
placement and velocity vectors define the control law. For any finite-rank, 
bounded linear feedback law for a control system on a Hilbert space, the 
existence of such gains is obvious and well known. A functional control gain 
for a flexible structure will have one or more distributed components, or ker- 
nels, corresponding to each distributed component of the structure and scalar 
components corresponding to each rigid component of the structure. 

Analogous to the functional control gains are functional estimator gains 
corresponding to the finite number of sensor measurements. The functional gains 
play a prominent role in our analysis. They give a concrete representation of 
the infinite dimensional compensator and provide a criterion for convergence of 
the approximating finite dimensional compensators. 

The optimal LQG compensator is Infinite dimensional in general. The 
transfer function of this compensator is irrational, but it is still an m(number 
of actuators) x p(number of sensors) matrix function of a complex variable, as 
in finite dimensional control theory. The optimal closed-loop system consists 
of the distributed model of the structure controlled by the optimal compensator. 

We develop the approximation scheme for the LQG control problem in Chapter 
7. We define a sequence of finite dimensional LQG problems, whose solutions 
approximate the solution to the infinite dimensional problem of Chapter 6. The 
solution to each finite dimensional problem is based on the solutions to two 
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Riccatl matrix equations, and we give formulas for using these solutions to com- 
pute approximations to the functional control and estimator gains as linear com- 
binations of the basis vectors. The approximating functional gains indicate how 
closely the finite dimensional compensators approximate the infinite dimensional 
compensator. The brief discussion in Section 7.6 is the only place in the 
monograph where stochastic estimation theory is necessary, and none of the 
analysis in the rest of the monograph depends on this discussion. 

Chapter 7 presents the finite dimensional equations to be used in designing 
the finite dimensional compensators. This chapter contains no convergence 
analysis. The equations in Chapter 7 can be used for numerical design of the 
finite dimensional compensators, without worrying about the meaning of these 
compensators with respect to the infinite dimensional control problem. 

Chapter 8 contains convergence theory that gives conditions under which the 
approximating compensators in Chapter 7 converge to the infinite dimensional 
compensator in Chapter 6. This theory also describes the sense in which the 
finite dimensional compensators converge. The compensator convergence is 
discussed in terms of both the convergence of the approximating functional gains 
and the convergence of the transfer functions of the finite dimensional compen- 
sators. 

In Chapter 9, we compute approximating finite dimensional compensators for a 
compound structure that consists of an Euler-Bernoul 1 i beam attached on one end 
to a rotating rigid hub and on the other end to a lumped mass. We emphasize the 
fact that we do not solve, or even write down, the coupled partial and ordinary 
differential equations of motion. For both the definition and numerical solu- 
tion of the problem, only the kinetic and strain energy functionals and a dissi- 
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patlon functional for the damping are required. We show the approximating 
functional control gains obtained by using a standard finite element approxima- 
tion of the beam, and we discuss the effect on convergence of structural damping 
and of the ratio of state weighting to control weighting in the performance 
index. As suggested by a theorem in Chapter 8, the functional gains do not con- 
verge when no structural damping is modeled. 

We study the convergence of the finite dimensional compensators by examining 
convergence of the approximating functional gains and the frequency responses of 
the finite dimensional compensators. Also, we compute the eigenvalues of the 
closed-loop system consisting of a finite dimensional compensator and larger- 
order model of the structure. These eigenvalues indicate how many modes are 
controlled how much by the infinite dimensional compensator and by the finite 
dimensional compensators that are essentially identical to the infinite dimen- 
sional compensator as far as the input/output map. The functional gains and the 
compensator frequency response indicate the order of a finite dimensional com- 
pensator necessary to approximate the infinite dimensional compensator closely, 
and the closed-loop eigenvalues confirm this order. 

Chapter 10 discusses several further issues important for implementation of 
compensators designed by the methods in this book, and suggests further research 
to make the ideas presented here useful for application to other control 
problems for large flexible structures. 
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2. A Finite Dimensional Model of a Flexible Structure 
2.1 Forced Linear Vibrations 

The forced vibrations of a large class of flexible structures may be 
described approximately by the second-order finite dimensional differential 
equation 

(2.1.1) Mx + GX + DX + Kx + Fx = f Q 

where the generalized displacement x(t) and the generalized forcing function 
f q ( t ) are n-vectors, the mass matrix M, damping matrix D and stiffness matrix K 
are real symmetric n x n matrices, and the gyroscopic matrix G and circulatory 
matrix F are real skew-symmetric n x n matrices. The mass matrix is positive 
definite, and the damping matrix is nonnegative. In most cases the stiffness 
matrix is also nonnegative, but some gyroscopic systems can have stiffness 
matrices which are indefinite. Any zero eigenvalue of < corresponds to a rigid- 
body mode. The gyroscopic matrix G results from rotating components, and the 
circulatory force matrix F may result from either follower forces or damping 
described in rotating reference frames. The terms M x, G X and K x are conser- 
vative terms in (2.1.1) while D X and F x are nonconservative. The forcing 
function fp is often expressed as 

(2.1.2) f Q = B q u (t) 

In this expression, B Q is a real n x m input distribution matrix and u(t) is an 
m x 1 input vector which could represent feedback control forces, external 
forces or both. In the present development, u will represent control forces or 
moments. The input distribution matrix depends on where the control or external 
forces act on the system, and it determines the relative effect of the inputs on 
the various coordinates. 
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The first-order form of (2.1.1) is 


(2.1.3) t = Az + f 

r T T ,T 

where the 2n x 2n state vector z is z(t) = [x k J and 




0 I 


0 

(2.1.4) 

A = 

. -H -1 (K+F) -M _1 (G+D) 

, f = 

f 0 


Linear measurements of the state vector have the form 
(2.1.5) y=Cz+CgU = [Cp C^J ( ^ ) + Cg u 

where C, Cg, Cp and C y are real matrices of appropriate dimensions. 

2.2 Natural Modes for Nongyroscoplc Systems 

Natural modes and natural frequencies may be defined by considering free 
vibrations of the conservative part of (2.1.1), i.e., 

(2.2.1) Mx+GX+Kx=0 

We first examine the case where G is zero and K is nonnegative. In Laplace 
transform notation, (2.2.1) reduces to: 

(2.2.2) [s 2 M + <] x = 0 

Equation (2.2.2) defines an eigenvalue problem which may be solved for s and x. 
Since M and K are real symmetric matrices with M positive definite and K non- 
negative, the solutions for s consist of 2n purely imaginary complex conjugates, 
s p = ±jWp, r=l,2,...,n where the w r 's are called the natural frequencies . The 
corresponding eigenvectors x p are real and are called the natural mode shapes. 
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The mode shapes are complete in R n and mode shapes corresponding to distinct 
frequencies are orthogonal with respect to both M and K. 

If U is the matrix whose columns are the natural mode shapes normalized with 
respect to the mass matrix M, then substituting 

(2.2.3) x(s) = U a(s) 

in (2.2.2) and multiplying the equation on the left by U T yields 

(2.2.4) s 2 a + n 2 a = U T f Q 
where 

0 

•J. 

The components of the n-vector a(t) are called the modal coordinates of the 
structure, and the n scalar equations in (2.2.4) are called the modal equations 
of motion. 

2.3 Modal Damping 

If a damping matrix D is added to (2.2.2), then (2.2.4) becomes 
(2.3.1) s 2 a + s U T DU a + fi 2 a = U T f Q 

The damping represented by the matrix D is called modal damping if U^DU is 
diagonal. In this case, the damping does not couple the modes of undamped free 


(2.2.5) 


Q = U KU = 


w. 


w. 
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vibration and the eigenvectors of the damped structure are the same as the 
eigenvectors of the undamped structure. The damping matrix U T DU is diagonal if 
and only if M _1 D commutes with M -1 K, or equivalently, if and only if M -1 D and 
M _1 K have the same eigenvectors. In particular, U T DU is diagonal if M -1 D is 
equal to a convergent power series in M _1 K or if M _1 D is equal to a linear com- 
bination of fractional powers of M _1 K. Although an assumption of modal damping 
greatly simplifies the analysis of vibrating structures, it is not always easy 
to reconcile this assumption with physical reality. (See Section 2.5.) In the 
present discussion, we are thinking primarily of the damping matrix D as 
resulting from some material damping model for the flexible components of the 
structure and/or models of joint damping or passive dampers attached to the 
structure. In practice it is common to assume that the matrix U T DU is diagonal 
and the values of modal damping ratios are estimated from experimental data. 

The only justification for this questionable assumption is that for sufficiently 
light damping, the eigenvectors of the finite dimensional model approach the 
natural mode shapes from (2.2.2). Hence, whatever the damping matrix D, as long 
as it Is sufficiently small, a damping matrix D for which U T DU is diagonal will 
produce both eigenvalues and eigenvectors for the structure model that are close 
to those produced by the correct D. 

2.4 Gyroscopic Systems 

If G is not zero in (2.2.1), then the procedure for finding natural frequen- 
cies and natural modes is conceptually and computationally more complicated. 
Meirovitch [Mel] has suggested a procedure which casts the equations in a 
form analogous to (2.2.2) where the coefficient matrices are symmetric. Once in 
this form, it is possible to determine the natural frequencies and natural modes 
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as above (with some differences in interpretation). The procedure depends on K 
being positive definite. If K is only positive semi-definite the procedure can 
still be applied, but first the rigid body modes must be removed from (2.2.1). 
The system which remains is then treated as follows. First, rewrite (2.2.1) in 
an augmented form which has a symmetric and a skew symmetric coefficient matrix: 


(2.4.1) 


M 0 ( M + 
0 K X ' + 


-K 0 v x ' 1 0 ' 


T T T 

Take the Laplace transform of (2.4.1) with (X x ) 


(2.4.2) 


M 0 
0 K 


f\ + 


G K 
-K 0 


■(X) 


replaced by q. 


The eigenvalue problem defined by (2.4.2) may be solved for s and q. The solu- 
tions for s consist of 2n purely imaginary complex conjugates, s f = ±jw p , 
r«l,2,...,n. As before, the w r 's are called the natural frequencies. The 
corresponding solutions for q also occur in complex conjugate pairs and have the 
form q p = v p + j w p , r=l,2,...,n. Setting s p = jw p and q p = v p + j w p in 
(2.4.2) and equating the real and imaginary parts to zero separately yields two 
equations for v r and w r . 

0 

. 0 . 


(2.4.3) 




MO' 


G K 


< 

1 

+ 


.OK. 

1 

. -K 0 . 


w. 


and 


(2.4.4) 


M 

0 

w„ + 

G 

K ' 

v„ = 

0 

0 

K 

r 

-K 

0 

r 

0 


These equations may be rearranged in the form: 
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M 

0 


(K-GM _1 G) 

-GM _1 K 


0 

(2.4.5) 

to 2 

r 

0 

K 

v r - 

. KM" 1 

KM _1 K . 

v r = 

0 



M 

0 


(K-GM -1 G) 

-GM -1 K 


0 

(2.4.6) 

w 2 

r 

0 

K 

w r " 

. KM' 1 

KM -1 K 

w p = 

0 


The appeal of (2.4.5) and (2.4.6) is that in each equation, the left hand matrix 
is symmetric and positive definite and the right hand matrix is symmetric as in 
the case where G is zero (see Eq. (2.2.2)). These features permit the use of 
simpler numerical algorithms, and the eigenvalue analysis proceeds as in the 
nongyroscopic case. There are, however, some differences in the interpretation 
of the eigenvectors since these vectors now include velocities as well as posi- 
tion coordinates. These eigenvectors are not in general referred to as natural 
modes. Refs. [Mel, Me2, Me3] discuss the solution and Interpretation of (2.4.5) 
and (2.4.6) at some length. 


It will also be noted that Eqs. (2.4.5) and (2.4.6) are identical. This 
does not mean that v p and w p are identical. From (2.4.3) and (2.4.4) one can 
show that v p and w p must satisfy the equation 


(2.4.7) w p = 


w I 
r 


-to -1 1 

r 


Thus only one eigenvalue problem must be solved to obtain both w p and v p . If 
(2.4.5) is used to obtain v p , then (2.4.7) can be used to calculate w p . The 
w p obtained in this way will satisfy (2.4.6). 
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2.5 A Lumped Mass Example 


As noted in section 2.3, there may be some difficulty in justifying the 
assumption of modal damping. This will now be illustrated by means of the 
simple example shown in Figure 2.1. This structure consists of three masses, 
two springs and three dampers. The three degrees of freedom include one rigid- 
body mode and two elastic modes. It is instructive to consider the implications 
of assuming modal damping with the same damping ratio in each mode. Because we 
want the damping in each mode to be proportional to the undamped frequency of 
that mode, there should be no damping in the rigid-body mode, so there are no 
dampers attached to the ground. 



Figure 2.1. Lumped Mass Example 

For the coordinates Xj, x 2 and x^ shown in Figure 2.1, the mass stiffness 
and damping matrices are 




m l 

0 

0 


k -k 

0 

(2.5.1) 

M = 

0 


0 

, K * 

-k 2k 

-k 



0 

0 

m 3 


0 -k 

k 


and 
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(2.5.2) 


D = 


( c l + c 2 ) -Cj -c 3 

-Ci (c 1+ c 2 ) -c 2 

" c 3 " c 2 ( C 2 +C 3) 


The damping coefficients c^ c 2 and c 3 are nonnegative. Suppose that we have 
the (dimensionless) mass and spring constants 


= 0.1 m 2 = 1.0 m 3 =0.1 k = 1.0. 

For the matrix U T DU in the modal equation (2.3.1) to be equal to, say, .02 times 
the square root of the matrix on the right hand side of (2.2.5) with n = 3 (i.e., 
for one percent critical damping in each mode) the damping matrix D must be 


(2.5.3) 


D = 


0.010379 

-0.014434 

0.004055 


-0.014434 

0.028868 

- 0.014434 


0.004055 

-0.014434 

0.010379 


No combination of the damping coefficients in Eq. (2.5.2) produces this damping 
matrix. 


With more elaborate combinations of pulleys and dampers, infinitely many 
damping configurations are possible, so it may be possible to realize the 
damping matrix in (2.5.3) by some arrangement of dampers. But even if a reali- 
zation of the required damping exists, it may be a very unlikely physical con- 
figuration. The difficulty of physically realizing constant-damping-ratio modal 
damping in this example suggests that assuming a form, particularly a diagonal 
form, for the damping matrix in the modal equations of motion, without con- 
sidering the physical sources of the damping, can produce improbable damping 
models. 
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3. An Infinite Dimensional Model 

3.1 Infinite Dimensional Equation of Motion 

Now we will describe the abstract model that we will use throughout the book 
to represent an infinite dimensional flexible structure. The generalized 
displacement vector x(t) is in a real Hilbert space H, the control vector u(t) 
is in R m for some finite m and x(t) is a mild solution to 

(3.1.1) x(t) + D 0 *(t) + A Q x(t) = BgU(t) , t > 0. 

In this equation of motion, the linear stiffness operator A Q is densely defined 
and selfadjoint with compact resolvent and at most a finite number of negative 
eigenvalues. The stiffness operators for structures involving beams, plates and 
membranes will satisfy the hypotheses here on Ag. Any nonpositive eigenvalues 
of Ag will represent rigid-body modes. For now, we will assume that the damping 
operator D Q is symmetric, nonnegative and bounded relative to A Q ; this includes 
linear material damping in continuous structural components and linear damping 
from a viscous fluid surrounding the structure (not likely in space applica- 
tions). In Section 3.4, we will discuss a more general type of damping that we 
will allow from then on, except where otherwise noted. The input operator B Q is 
a linear operator from R m to H, and hence bounded (so that this model does not 
include boundary control). 

For the distributed model of a flexible structure, we will not discuss a 
gyroscopic operator corresponding to the matrix G in Chapter 2. It is straight- 
forward to add a bounded skew selfadjoint operator to the damping operator Dg 
and generalize our infinite dimensional system theory and approximation theory 
appropriately, but we do not want to complicate the exposition of this and sub- 
sequent chapters by carrying this extra detail along. 
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Remark 3.1. Our analysis includes the system 


(3.1.1') M 0 x(t) + D 0 *(t) + A 0 x(t) = B 0 u(t), t > 0, 

where the mass operator Mg is a selfadjoint, bounded and coercive linear opera- 
tor on a real Hilbert space Hg. The operators Ag, Bg and Og in (3.1.1* ) have 
the same properties with respect to Hg that the corresponding operators in 

(3.1.1) have with respect to H. To include (3. 1 . 1 ' ) in our analysis, we need 

only take H to be Hg with the norm-equivalent inner product = <Mq*,*>Hq, 

and multiply (3. 1 . 1 ' ) on the left by Mg 1 . In H, the operator M^Ag is self- 
adjoint with compact resolvent, and M^Dg is symmetric and nonnegative. With no 
loss of generality, then, we will refer henceforth only to (3.1.1) and assume 
that the H-inner product accounts for the mass distribution. VVV 

By natural modes of a structure represented by (3.1.1), we will mean the 
eigenvectors of the eigenvalue problem 

(3.1.2) ^j^*j = ^0 ^ j * 

From our hypotheses on Ag, it follows that these eigenvalues form an infinitely 
increasing sequence of real numbers, of which all but a finite number are posi- 
tive. Also, the corresponding eigenvectors are complete in H and satisfy 

(3.1.3) <d> 1 , = <A 0 0 j > H * 0, i * j. 

(For these standard properties of the eigenvalue problem, see [Bal, Ka2] and 
other texts.) For Aj > 0, Wj = > 0 is a natural frequency. 

3.2 The Energy Spaces 

To discuss the damping in (3.1.1) more precisely, to derive the first-order 
form of (3.1.1) and to specify the class of measurements that we will consider. 
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we need to define two additional Hilbert spaces, the strain energy space V and 
the total energy space £, in terms of the basic space H and the stiffness opera- 
tor Ag • We choose a bounded selfadjolnt linear operator Aj on H such that Aj 
Is positive definite on the eigenspace of A Q corresponding to nonpositive eigen- 
values of Ag and the null space of Aj is the closed span of the eigenvectors of 
Ag corresponding to positive eigenvalues. Thus Ag = Ag + Aj is coercive; i.e., 
there exists p > 0 such that 

(3.2.1) <AgX,x> £ p | x | 2 , x e D(Ag) = D(A Q ). 

We define the Hilbert space V to be the completion of D(Ag) with respect to the 
inner product 

(3.2.2) <Vj,V2^y = ^Ag ^1*^2 ^ 

Equivalently, V = D(Ag) and <v lt v 2 > v = <Ag Vj, Ag v 2 > H . 

In the usual way, we will use the imbedding 

(3.2.3) V c h = H' c V' , 

where the injection from V into H and from H 1 into V 1 are continuous with dense 
ranges. We denote by Ay the Riesz map from V onto its dual V 1 ; i.e., 

(3.2.4) <v,Vj> v = (AyVjjv , Vj, v e V. 

Then A Q is the restriction of Ay to D (Ag) in the sense that 

(3.2.5) ( Ay v x ) v = <v,A 0 v 1 > h , v x e D(A Q ) , v e V. 

Remark 3.2. We began our description of the control system model with (3.1.1) 
because its form is familiar in the context of flexible structures. The stiff- 
ness operator Ag, for example, is the infinite dimensional analogue of the 
stiffness matrix in Chapter 2. In applications, though, it is often easier to 
begin with a strain-energy functional from which the correct strain-energy inner 


21 



product is obvious. There is a one-to-one correspondence between the stiffness 

operator A Q and the strain-energy space V. We have seen how V is defined in 

terms of A Q . If V is specified first, then A Q is defined in terms of the Riesz 

~ 1 

map for V by (3.2.5) with D(A q ) = Ay H (see [Shi] also for this approach). 

Either way, the relationship between A Q and V is the same. 

This means that if the space H and its inner product (determined by kinetic 
energy; recall Remark 3.1) and the space V and its inner product (determined by 
strain energy and geometric boundary conditions) are written down, then the 
operators Aq and Ay are determined implicitly. For both our theoretical analy- 
sis and our numerical computation, only the H and V inner products are needed; 
neither A Q , Ay nor Ay 1 need be written down (although it is instructive to 
write the partial differential operator Aq for simple examples, as in Section 
3.3). VVV 

Now we define the total energy space E = V x H, noting that when Aq is coer- 
cive and x(t) is the solution to (3.1.1), then | (x(t) ,X(t)) is twice the 
total energy (kinetic plus potential) in the system. The total energy space is 
the natural space for the infinite dimensional state-space model of the flexible 
structure; i.e., the first order form of (3.1.1), which we will derive in 
Section 3.5. 

3.3 Clamped -Free Beam Example 

For a simple example, we consider the clamped-free Euler-Bernoul 1 i beam in 
Figure 3.1. The length of the beam is 1 and the product of the modulus of 
elasticity and the second moment of the cross section is El. The single control 
force u is spread uniformly over the last five percent of the beam at the right 
end. 
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Figure 3.1: Clamped-free Euler-Bernoull 1 Beam 


For this example, the most natural Hilbert space H is L 2 (0,1), and the 
stiffness operator is 

(3.3.1) A q = El d 4 /ds 4 , D(A q ) = {g e H 4 (0,1): g(0) = g'(0) 

= g" (1) = g" ' (1) = o}. 1 

We model Kelvin-Voigt viscoelastic damping in the beam [CPI], which means that 
the damping operator is 

(3.3.2) Dg = Cg Ag 

where c Q is a positive constant. The actuator influence operator is given by 


l We denote by H k (0,l) the k th -order Sobolev space of functions on (0,1). A 
function f is in this Hilbert space if f, along with its, derivatives of orders 
up through k-l,.are absolutely continuous on [0,1] and f' K ' e L 2 (0,1). The 
square of the H K (0,1) norm of f is 



[d j f(t)/dt j ] 2 dt. 

j-1 0 



0, 0 < s < .95, 


(3.3.3) B q u = u b Q , b Q (s) = 

1, .95 < s < 1. 

Since A Q in this example is coercive, A 1 = 0 and Aq = A Q . The strain-energy 

2 

space V in this example is {v e H (0,1): v(0) = v'(0) = 0}, which follows from 
defining the inner product 


(3.3.4) <v 


r 


v 2 > V 


= <^q v j * 


Vh = 


l 1 


II II, 

Vj v 2 ds. 


V v 2 


D(A n ), 


according to (3.2.2) and completing D(Aq) with respect to the norm induced by 
this inner product. Although the geometric boundary conditions v(0) = v'(0) = 0 
are the only boundary conditions retained explicitly in V, the boundary con- 
ditions v 1 1 (0) = v ' ‘ ' ( 1 ) =0 are retained implicitly by the imbedding of V in H 
and the Riesz map for V because specifying H and V Is equivalent to specifying H 
and Aq, as discussed in Remark 3.2. That H and V determine all of the boundary 
conditions for A Q should remind readers experienced in the structural dynamics 
of applying Hamilton's principle to derive the equations of motion for a distrib- 
uted model of a flexible structure: once the energy functionals and geometric 
boundary conditions are defined, the natural boundary conditions follow from 
integrating by parts in the spatial domain. 


We should note that the coercive, selfadjoint operator Aq exists for this 
example, but that it is not a differential operator. This square-root operator 
cannot be written down in closed form, but its Infinite dimensional matrix 
representation with respect to its eigenvectors is diagonal with the natural 
frequencies of the clamped-free beam on the diagonal. The operators Aq and 
Aq have the same eigenvectors, which are complete and mutually orthogonal in 
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both H and V. 


3.4 The Damping Functional and Operator 

Before we discuss the first-order form of (3.1.1), we will state the damping 
hypothesis that we will use from here on and discuss the representation of the 
damping admitted under this hypothesis. To construct the first-order form of 

(3.1.1) , we do not require an operator D Q defined from some subset of H Into H. 
Rather, we need only the weaker assumption that there exists a damping (or 
dissipation) functional 

(3.4.1) d Q (* ,*) : V x V — > R 

such that dg is bilinear, symmetric, continuous on V x V and nonnegative. Under 
this hypothesis on dg, there is a unique nonnegative, selfadjoint operator 
D y e L(V) such that 

(3.4.2) = <DyV^, V£^y = ^V^2^V’ ^1* ^2 ^ 

If we have a symmetric, nonnegative damping operator Dg defined from 0 (A Q ) 
into H such that Dg is bounded relative to Ag, then <Dq*,*>^ defines a bilinear, 
symmetric, continuous, nonnegative functional on a dense subset of V x V. In 
this case, dg is the unique extension of this functional to V x V and 

(3.4.3) D v I D(A q ) = A 0 D 0* 

(That Dg being Ag-bounded implies continuity of <Dq*,*> h with respect to the V 
norm follows from [Ka2, Theorem 4.12, page 292].) 

In applications, either Dg or d Q should be straightforward to write from the 
physics of the structure, but D y is difficult to determine except for simple 
structures or damping that is a simple function of stiffness. 


25 



Remark 3.3. For our theoretical analysis and our numerical work, the damping 
functional d Q is sufficient; it is not necessary to write down either of the 
damping operators Dy and Dg. (As with the stiffness operator, it is instructive 
to write the damping operators for simple examples; see Section 3.7.) 

3.5 The Semigroup Generator and the First-Order Form of the Equation of Motion 

We want to write (3.1.1) as a well-posed first-order evolution equation on 
the total energy space E = V x H. To do this, we will determine the appropriate 
semigroup generator for the open-loop system, so that Cg-semigroup theory will 
guarantee the existence and uniqueness of solutions to the state-space differen- 
tial equation under appropriate conditions on u(t). We will derive the 
semigroup generator by constructing its inverse explicitly, and then we will try 
to convince the reader that we do have the appropriate semigroup generator. 

This approach seems mathematically efficient, and we will need the inverse of 
the generator for approximation theory. 

We define A -1 e L(E,E) by 

(3.5.1) A' 1 = 

This operator is clearly one-to-one, and its range is dense, since V is dense in 
H and D(Aq) is dense in V. Now, we take 

(3.5.2) A = (A -1 )" 1 . 

Direct calculation of the inner product shows 

(3.5.3) <r l C h ) , (JJ)> E = -dg(h,h). 
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so that A Is dissipative with dense domain. Also, since D (A -1 ) = E, A is maxi- 
mal dissipative by [Gil, Theorem 2.1]. Therefore, ~A generates a Cg-contraction 
semigroup on E. 

Finally, the open-loop semigroup generator is 


(3.5.4) 


A = A + 


0 0 


Aj 0 


D(A) = D (A) , 


where Aj is the bounded linear operator discussed in Section 3.2. With 


(3.5.5) B 


0 

B, 


e L(R m ,E), 


the first-order form of (3.1.1) is 

(3.5.6) z(t) = A z(t) + B u(t), t > 0, 

where z = (x, x) e E. 


To see that A is indeed the appropriate open-loop semigroup generator, sup- 
pose that Ag is coercive (so that A^ = 0) and that we have a symmetric, non- 
negative Ag-bounded damping operator Dg. Then the appropriate generator should 
be a maximal dissipative extension of the operator 


(3.5.7) A = 


0 I 


9 


-A 


0 



D(A) = D(Ag) X D(Ag) . 


It is shown in [Gil, Section 2] that A has a unique maximal dissipative exten- 
sion, and after noting (3.4.3), it can be shown easily that the A(= A) defined 
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by (3.5.1) and (3.5.2) is an extension of A. 


We should note that Showalter [Shi, Chapter VI] elegantly derives a 
semigroup generator for a class of second-order systems that includes the 
flexible-structure model here. The presentation here is most useful for our 
approximation theory because of the explicit construction of the inverse of the 
semigroup generator. For the purposes of this paper, we do not need to charac- 
terize the operator A itself more explicitly, but we should make the following 
points. 

First, from A -1 we see 

(3.5.8) D(A) = {(x,X): k e V, x + O^k e D(A Q )}. 

In applications, the "natural boundary conditions" can be determined from 
(3.5.8) and the boundary conditions included in the definition of D(A Q ) (see 
Section 3.8), although we will not need the natural boundary conditions. In the 
case of a damping operator that is bounded relative to Ag, D(A) = D(A Q ) x V. 

If the damping operator is bounded relative to A^ for y < 1, then A has compact 
resolvent. 

In many structural applications, the open-loop semigroup is analytic, 
although this has been proved only for certain important cases. Showalter 
obtains an analytic semigroup when the damping functional is V-coercive; for 
example, when there exists a damping operator that is both A Q -bounded and as 
strong as Ag. Such a damping operator results from the Voigt-Kelvin viscoelastic 
material model. Also, it can be shown that the semigroup is analytic for a 
damping operator equal to c Q Ajj for ^ < y < 1 and Cg a positive scalar. The 
case y = &, which produces the same damping ratio in all modes, is especially 
common in structural models, and Chen and Russell [CR1] have shown that the 
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semigroup is analytic for a more general class of damping operators involving 
M 0* 

We can guarantee that the open-loop semigroup generator is a spectral opera- 
tor (i.e., its eigenvectors are complete in E) only for a damping operator that 
is a linear combination of an H-bounded operator and a fractional power of A Q . 
However, nowhere do we use or assume anything about the eigenvectors of either 
the open-loop or the closed-loop semigroup generator. The natural modes — of 
undamped free vibration -- in (3.1.2) are always complete in both H and V. 

The Adjoint of the Semigroup Generator. Since D v is selfadjoint on V, 

— * — i * ~_i 

direct calculation shows that A = (A ) -- the adjoint of A with respect to 

the E-inner product -- is 

(3.5.9) A" 1 = 

1 ^ ft 

Then A = (A" ) . Having A - explicitly facilitates proving strong convergence 

for approximating adjoint semigroups. 

Exponential Stability of the Open-Loop System. According to the following 
theorem, a sufficient condition for the open-loop system to be uniformly expo- 
nentially stable is that there be no rigid-body modes and the damping be coer- 
cive. Coercive damping means, basically, that all structural components have 
positive damping. That the decay rate given for the energy norm depends only on 
the lower bound for the stiffness operator and the upper and lower bounds for 
the damping functional is essential for convergence results for the approxi- 
mating optimal control problems of subsequent chapters. 

Theorem 3.4. Suppose that A Q and d Q are H-coercive. Let p be the positive 
constant in (3.2.1), and let S Q and S 1 be positive constants such that 
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(3.5.10) S 0 |v|* £ d Q (v,v) < Sjvly, v e V. 

Then 

s 

(3.5.n) i t ( t> i * ( i + ^ + w 2)> * exp l ]• t!:0 - 

For a proof of this theorem, see [GA1]. The proof uses an explicit Liapunov 
functional for the open-loop system. 

3.6 Representation of the Open-Loop Semigroup In the Case of Uncoupled Modes 
Since the eigenvectors (J»j of A Q are complete in V, x(t) can be written 

co 

(3.6.1) x ( t) = a j( fc ) 
j = l 

where a.(t) is the modal amplitude for the j th mode. The energy space E = V x H 
J 

is isomorphic to the space $ 2 e sequences 

a i 

a i 

a 2 
ao 


that are square summable in the sense of 

(3.6.3) | a | 2 = Y_ a j + a j) < " 

E j-1 

where w. is the j th natural frequency. 

J 
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When a damping operator D Q exists and has the same eigenvectors as A Q , the 
damping leaves the natural modes of free vibration uncoupled and (3.1.1) is 
equivalent to the infinite set of modal equations 

(3.6.4) ayt) + 2 Cj Wj ^(t) + ^ a^t) = B Qj u(t) 
where 

(3.6.5) Cj = <D 0 4> j . <frj> H /2u)j|dij^ 

is the damping ratio of the j th mode and B Q j is a 1 x m matrix. On i 2E , the 
semigroup T(t) is represented by an infinite dimensional matrix with all zeros 
except for 2x2 blocks T.(t) centered on the main diagonal and given by 



3.7 The Measurement Equation and Operators 

Recall that the second-order equation (3.1.1) and the first-order equation 

(3.5.6) are equivalent. We assume that the measurement has the form 

(3.7.1) y(t) = C Q u(t) + C z(t) 

where y(t) is a p-vector, Cg is a real p x m matrix and C is a bounded linear 
operator from E to R p . Since E = V x H, C must have the form 

(3.7.2) C = [Cj C 2 ] 

where Cj and are bounded linear operators from V and H, respectively, to R p . 
This means that (3.7.1) can be written 
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(3.7.3) y(t) = C Q u ( t) + Cj x(t) + C 2 X(t). 

Also, according to Riesz, if (C z(t)). is the i th component of the p-vector 
C z(t) , then 

(3.7.4) (Cz(t)),| = , x^y + ^c^^, X>^, i = 1, ..., p, 

where c^ e V and c 2i e H. 

3.8 Further Comments on the Clamped-Free Beam 

According to Section 3.5, once the basic space H was defined in Section 3.3 
to be l_ 2 (0,l), the first-order form of the equation of motion for the beam in 
Section 3.3 was determined implicitly by the stiffness operator in (3.3.1), the 
damping operator in (3.3.2) and the actuator influence operator in (3.3.3). It 
follows from Theorem 3.4 that the open-loop beam (i.e., the free response of the 
beam) is uniformly exponentially stable for any positive damping coefficient c Q 
in (3.3.2). 

As discussed in Remark 3.2, specifying the strain-energy inner product in 

(3.3.4) and the boundary conditions on the left end of the beam would have been 
equivalent to specifying Ag. Also, according to Section 3.4, specifying the 
dissipation functional to be 

(3.8.1) dg(v^, v 2 ) - Cg (v^, v 2 >y, Vj, v 2 e V, 

would have been equivalent to specifying the damping operator Dg. 

To see the advantage of being able to use the damping functional instead of 
the damping operator, suppose that we attach a linear damper with positive 
coefficient c ^ between the right end of the beam and ground. Then there is no 
damping operator Dg, but it is easy to write the dissipation functional 
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Vj, V 2 € V, 


(3.8.2) dg(Vj, v 2 ) = c Q <v r v 2 > v + c x v 1 (l)v 2 (l), 
since this means that the rate of energy dissipation for the open-loop beam is 

(3.8.3) E = d 0 (x(t), x(t) ) = -c Q j 1 x"(t,s) 2 ds - Cj x(t,l) 2 . 

0 

For none of our purposes in subsequent chapters do we need to go further in 
characterizing the damping. Nonetheless, it is instructive to look at the 
natural boundary conditions that the damping produces and how these boundary 
conditions can be determined with the results discussed so far. In this example 
(and probably only a few others), it is easy to write the operator D v discussed 
in Section 3.4. The part of Dy corresponding to the Kelvin-Voigt damping is c Q 
times the identity in V, which follows from (3.4.3). The part of Dy correspond- 
ing to the linear damper on the end of the beam is less obvious, but it can be 
determined with (3.4.2) and integration by parts. The result (for the total 
damping functional in (3.8.2)) is 

(3.8.4) D v v = c 0 v + v(l) g Q 
where g Q e V is 

(3.8.5) g Q (s) = (1 - s) 3 /6 + s/2 - 1/6, 0 <; s < 1. 

The natural boundary conditions for the equation of motion for the beam 
follow from (3.5.8). With the D(Ag) and V defined in Section 3.3 and Dy given 
by (3.8.4) and (3.8.5), (3.5.8) says that the domain of the semigroup generator 

4 

contains those function pairs (x, X) that satisfy X e V, x + CgXeV = H (0,1), 

(3.8.6) (x + c 0 X)' '(1) =0 
and 

(3.8.7) (x + Cg X) 1 1 '(1) + Cj X(l) = 0. 

These last two equations are the natural boundary conditions. 
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3.9 Hub-Beam-Tip-Mass Example 


One end of the Euler-Bernoul 1 i beam in Figure 3.2 is attached rigidly 
(cantilevered) to a rigid disc which is free to rotate about its center, point 
0, which is fixed. Also, a point mass m^ is attached to the other end of the 
beam. The control is a torque u applied to the disc, and all motion is in the 
plane. 



Figure 3.2. Flexible Structure 


Table 3.1. Structural Data 


r = hub radius 
1 = beam length 

Ig = hub moment of inertia about axis 
perpendicular to page through 0 
m b = beam mass per unit length 
nij = tip mass 

El = product of elastic modulus and second 
moment of cross section for beam 
Fundamental frequency of undamped structure 


10 in 
100 in 

100 slug in 
.01 slug/in 
1 slug 

13,333 slg in 3 /sec 2 
.9672 rad/sec 
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The angle 0 represents the rotation of the disc (the rigid-body mode), 
w(t,*) is the elastic deflection of the beam from the rigid-body position, and 
Wj(t) is the displacement of m 1 from the rigid-body position. For technical 
reasons, we do not yet impose the condition w^t) = w(t,l); more on this later. 

The control problem is to stabilize rigid-body motions and linear (small) 
transverse elastic vibrations about the state 0=0 and w = 0. Our linear model 
assumes not only that the elastic deflection of the beam is linear but also that 
the axial inertial force produced by the rigid-body angular velocity has negli- 
gible effect on the bending stiffness of the beam. The rigid-body angle need 
not be smal 1 . 

For this example, it is a straightforward exercise to derive the three 
coupled differential equations (one partial and two ordinary differential 

I 

equations) of motion in 0, w and w^, and they do have the form (3.1.1 ). 

However, to emphasize the fact that we do not use the explicit differential 
equations, we will not write these equations here. Rather, we will write only 
what normally is needed in applications: the kinetic and strain-energy func- 
tionals, the damping functional and the actuator influence operator. 

Remark 3.1 applies to this example, and to most examples with complex struc- 
tures. The generalized displacement vector is 

(3.9.1) x = (O.w.Wj) e Hg = R x l^O,!) x R. 

The kinetic energy in the system is 

(3.9.2) Kinetic Energy = 1/2 <x,X> H 
where H is H Q with the inner product 
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f 1 

(3.9.3) <x,x> H = m b I [w + (r+s)8] [w + (r+s)0] ds + I Q 00 

0 

+ m 1 [w 1 + (r+l)©]^ + ( r+1 )0 ] . 

As in most applications we need not write the mass operator explicitly, but 
there exists a unique selfadjoint linear operator M Q on H Q such that 

(3.9.4) <x,x> H = <M q x,x> h . 

It is easy to see that Mg is bounded and coercive. Hence H Q and H have equiva- 
lent norms. 

The input operator for (3.1.1 ') (which maps R to Hg) is 

(3.9.5) Bg = (1,0,0). 

Since we multiply (3.1.1 1 ) by Mg X to get (3.1.1), the input operator for (3.1.1) 
is (Mg^Bg). Note that 

(3.9.6) (M^Bg)* 9 = B* , 

where (M^Bg)* 9 is the H-adjoInt of (M^Bg) and Bg is the Hg-adjoint of Bg. 

Remark 3.2 also applies here. The only strain energy is in the beam and is 
given by 

(3.9.7) Strain Energy = 1/2 a(x,x) 
with 

rl 

(3.9.8) a(x,x) = El I w 1 ' w ' ‘ ds, 

0 

where (•)' a d £ (*)/ds . To make a(*,*) into an inner product, we must account 
for rigid-body rotation. Thus we set 
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(3.9.9) <x,x>y = a(x,x) + 09 
and define 

(3.9.10) V = (x = (0,«|>.<t»(1 )):<D e H 2 (0,1), 0(0) = 0'(O) = 0}. 

Also, we have 

(3.9.11) <x,x> y = a(x,x) + <BqBqX,x>Hq = a(x,x) + <(Mq 1 B q ) (M^BqJ^x.x^, 

so that Aj = B o B o’ or ( M o B 0^ M 0 B o' * de P endin 9 on whether the H Q or the H- 
inner product is used in computing the V-Inner product. But we need neither Aj 
nor Aq explicitly. Me need only (3.9.8) and (3.9.9), along with (3.9.3), to 
compute the required inner products. 

As mentioned in Remark 3.2, the operator A Q can be defined now by (3.2.5), 
and the stiffness operator is A Q = A Q - A^. Using the H Q -inner product in 
(3.2.5) yields the A Q for (3.1.1*), and using the H-inner product yields the A Q 
for (3.1.1), which is M" 1 following the A Q for (3.1.1* ). The A Q for (3.1.1 ' ) 
is simple, and the reader might write it out. We will not, because we do not 
need it. 

We will point out that D(Ag) requires both the geometric boundary con- 
ditions in V and the natural boundary condition w ' ' ( t , 1 ) = 0; i.e., zero moment 
on the right end. That the geometric boundary conditions 

(3.9.12) w(t,0) = w'(t,0) = 0 
and 

(3.9.13) w(t,l) = Wj(t) 

are imposed in V but not in H — i.e., on the generalized displacement but not 
on the generalized velocity -- is common in distributed models of flexible 
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structures. The natural norm for expressing the kinetic energy of distributed 
components is the norm, which cannot preserve constraints on sets of zero 
measure. Because the strain energy involves spatial derivatives, the stronger 
strain-energy norm can preserve the geometric boundary conditions (although, as 
for the boundary slope of an elastic plate, the V-norm may impose some of these 
boundary conditions in an rather than a pointwise sense). The strain-energy 
norm is based on the material model of the distributed components of the system, 
and it should not be surprising that such a norm is required to connect the 
various structural components. 

We assume that the beam has Voigt-Kelvin viscoelastic damping, so that the 
damping operator in (3.1.1) is 

(3.9.14) Dg = Cg Ag 

where Cg is a constant. This means that the damping functional is 

(3.9.15) dg(x,x) = Cg a(x,x) , x,x e V. 
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4. Approximation of the Distributed Model of the Structure 

4.1 Abstract Finite Element Approximation 

We begin this chapter by discussing a Ritz-Galerkin approximation framework 
that accommodates most common finite element schemes for flexible structures. 

The only hypotheses for this abstract approximation theory (in addition to those 
given in Chapter 3 for the infinite dimensional structural model) are the 
requirements stated in Hypothesis 4.1 for the basis vectors. As indicated in 
Section 4.1.2, modal approximation fits easily into this framework. 

We assume that the basic space H is given, that the various operators and 
functionals associated with (3.1.1) satisfy the hypotheses stated in Chapter 3 
and that the energy spaces V and E = V x H are defined as in Section 3.2 of 
Chapter 3. 

Hypothesis 4.1. There exists a sequence of finite dimensional subspaces V p of V 
such that the sequence of orthogonal projections Py n converges V-strongly to the 
identity, where Py n is the V-projection onto V n . Also, each V n is the span of n 
linearly independent vectors e.. VVV 

J 

Since it should cause no confusion, we will omit the subscript n and write 
just ej, keeping in mind that the basis vectors may change from one V n to 
another, as in most finite element schemes. Also, we will refer to the Hilbert 
space E n = V p x V n , which has the same inner product as E = V x H. 

4.1.1 Approximating Equations of Motion 

For n £ 1, we approximate x(t) (the solution to (3.1.1)) by 
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(4.1.1) 


n 

x n (t) - £ Sjltlej 
j = l 

where £(t) = [^(t) € 2 (t) ... € p (t)] T satisfies 

(4.1.2) M n £ + D n £ + K n £ = bJu, 

and the mass matrix M n , damping matrix D n , stiffness matrix K n , and actuator 
influence matrix Bg are given by 

M n = [<e.,e.> H ], D n = Cd Q (e 1 ,e .)], 

(4.1.3) K n = C<Ag /2 e i ,Ag 7 ^ e.> H ] = [<e i ,e j > v ] - [<A 1 e. ,ej> H ], 

Bg = [<e i 

Of course, (4.1.2) can be written as 

(4.1.4) f| = A n n + B n u 

where 

(4.1.5) n = (*) 
and 

n ° I 1 n 0 

(4.1.6) A n = , B n = 

-M" n K n -M" n K n M' n Bg 

Note 4.2. Throughout this text, we use the superscript n in the designation of 
matrices in the n^ approximating system and control problem, like A n , B n , M n , 
etc. Hence the superscript n indicates the order of approximation — and it 
never indicates a power of the matrix. By M _n , we denote the Inverse of the 
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mass matrix M n . In the designation of a linear operator in the n th approxima- 
tion, we use the subscript n. For example, A n and B n are the operators whose 
matrix representations are A n and B n , respectively. VVV 


In the class of approximation schemes considered here, the approximation to 
the equivalent measurement equations (3.7.1) and (3.7.3) can be written 


(4.1.7) y n = C Q u + C n n = C Q u + C n (*) 

with 


(4.1.8) 



where the i th column of the p x n matrix is the p-vector equal to and 
the i th column of the p x n matrix is the p-vector equal to C^. The y n 
in (4.1.7) would be equal to the exact measurement if the true generalized 
displacement vector x(t) were a linear combination of the first n basis vectors 
for all t ( i . e . , if x(t) were equal to the x n ( t) in (4.1.1)) so that we could 
take E = E n . 


4.1.2 Modal Approximation 

The equations in Section 4.1.1 include the important case where the basis 
vectors e. are the mode shapes of undamped, free vibration (i.e., the eigenvec- 

J 

tors of the stiffness operator A Q ; recall (3.1.2) and (3.1.3)). In this case, 
the mode shapes usually are normalized with respect to mass so that the mass 
matrix M n is the n x n identity matrix. The corresponding stiffness matrix then 
is 
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0 


(4.1.9) 


K = 




w. 


0 ) 


2 

n 


where the w^'s are the natural frequencies of undamped, free vibration (i.e., 
the diagonal elements of K n are the first n eigenvalues of Ag). A zero fre- 
quency corresponds to a rigid-body mode. 


That Hypothesis 4.1 holds for modal approximation follows from the fact that 
the operator A Q | v is a compact selfadjoint operator on V, since A Q and Ag 1 
have the same eigenvectors. 


4.2 Convergence 

It is useful to note that (4.1.1) and (4.1.2) or (4.1.4) are equivalent to 
(4.2.1) z n (t) = A n z n (t) + B n u(t), 

• m 

where z p = (* n ,x n ) e E n and A p e L(E p ) and B p e L(R ,E p ) are the operators whose 
matrix representations are given in (4.1.6). Also, for any real A, 



is equivalent to 

(4.2.3) (A 2 M n + AD n + K n )a 1 = (AM n + D n )3 1 + M n 3 2 
and 

(4.2.4) a 2 = Aa 1 - 0 1 
if 


42 



(4.2.5) 


j » 1,2. 


n n 

= ) <r?e. and = ) p^e., 

n L 11 n i i’ 

i=l i=l 

(Substituting A n and (4.2.5) into (4.2.2) yields (4.2.3) and (4.2.4).) 

Next, we will prepare to invoke the Trotter-Kato semigroup approximation 
theorem to show how (4.1.2), (4.1.4) and (4.2.1) approximate (3.1.1) and 

(3.5.6) . For this, we will treat only the case in which A Q is coercive (no 
rigid-body modes), so that Aj = 0 and A Q = A Q ; the general case is a straight- 
forward extension. In the present case, then, the open-loop semigroup generator 
A is maximal dissipative, and for each n, A n is dissipative on E n . The main 
idea here is to project (A-A) -1 onto V n in a certain inner product and observe 
that the result is exactly (^-A n ) _1 , where A p is the operator on V n in (4.2.1) 
and (4.2.2). Of course, we need only do this for real A > 0. 

For real A > 0, then, we define an inner product on V by 

(4.2.6) < *’* > A = + »’) + < *»* > v 


Under the hypotheses in Chapter 2 on d Q , <*»*>^ is clearly norm-equivalent to 
<*,*> v . For n > 1, we let P n (A) be the projection of V onto V n in the inner 
product If h 1 , h 2 e H, 


(4.2.7) 



is equivalent to 


(4.2.8) 
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With A -1 from (3.5.1), (4.2.8) is equivalent to 

(4.2.9) (I + AD V + = (AAg 1 + D^h 1 + A'V 

and 

(4.2.10) v 2 = Av 1 - h 1 . 

If 

(4.2.11) vj = P^Ajv 1 and v 2 = P n (A)v 2 , 

it follows from (4.2.6) and (4.2.9) that 

(4.2.12) <e i* v J > A = <e i > yl> A 

= A <e^ > AqAq v >j_| + A<e. ,DyV >y + <e. ,v >y 

= <e i ,(A 2 Ag 1 + ADy + Ov^y 
- <e i ,(AA' 1 + Dyjh 1 + AjjV^ , 

and from (4.2.10) that 

(4.2.13) <e i ,v 2 > = <e i ,v 2 > = A<e i ,v 1 > A - <e 1 ,h 1 > A . 

Now, for h 1 = hj e V n> h 2 = h 2 e V n , and vj, v 2 , hj and h 2 written as in 
(4.2.5), (4.2.12) and (4.2.13) yield (4.2.3) and (4.2.4) again. 

This shows that 
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(4.2.14) 


P„(M 0 


0 P n (A) 


(A-A) _1 | e . (A-A r 1 , 
n 


which yields 


(4.2.15) 


p „(M 0 


0 P„(A) 


(A-A) _1 P e - (A-A n )" l P E , 


where P F is the E-projection of E onto E . The projection P F can be written 


(4.2.16) P p = 

where P.. is the V-projection onto V„, as before, and P u is the H-projection 
v n n H n 

onto V n> Since the V-norm is stronger than the H-norm, it follows from 

Hypothesis 4.1 that (A-A n ) _1 Pp converges E-strongly to (A-A)" 1 as n — > -. 

n tp 

Now, with A extended to E^; as, say, n(P F -I), Trotter-Kato [Ka2, page 504, 
n n t n 

Theorem 2.16] yields the following. 

Theorem 4.3. For Ag coercive, let T p (t) be the (contraction) semigroup gener- 
ated on E n by A n « Then, for each t > 0, ^ n (t)P^^ converges strongly to T(t), 
uniformly in t for t in any bounded interval. VVV 



In the general case, when A Q is not coercive, the open-loop generator A is 
obtained from the dissipative A by the bounded perturbation (3.5.4), so that 
[G13, Theorem 6.6] yields the following generalization of Theorem 4.3. 

Corollary 4.4. Let T n (t) be the semigroup generated on E n by A n< Then, for 

each t > 0, T (t)P F converges strongly to T(t), uniformly in t for t In any 
n tp 

bounded Interval. VVV 
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Theorem 4.5. When A has compact resolvent, (A-AJ _1 P,- converges in L(E) to 

n t n 

(A-A)" 1 . 

Proof. This follows from (4.2.15) and a standard result that the projection of 

a compact linear operator onto a sequence of subspaces converges in norm if the 

projections converge strongly to the identity, as do P F and P (A). 777 

b n n 

That the adjoint semigroups also converge strongly follows from an argument 
entirely analogous to the proof of Theorem 4.3. In particular, equations like 

(4.2.6) -(4.2.12) are used to show that 

[ p n< A > 0 

(4.2.17) (A-A*f P E = (A-A*) P E * 

0 P (A) n n 

_* _1 

In showing this, A is used as A was used above. Also, care must be taken to 
calculate A* with respect to the E-inner product. The result is 

Theorem 4.6. Let T n (t) be the sequence of semigroups in Corollary 4.4. Then, 
for each t > 0, T*(t)P c converges strongly to T*(t), uniformly in t for t in 

n t n 

any bounded interval. 777 

For the approximation to the actuator influence operator B e L(R m ,E), recall 
B n e L(R m ,E n ), the operator whose matrix representation is the matrix B n in 

(4.1.6) . From (4.1.3), it follows that 

(4.2.18) B n = P^B. 

Since B has finite rank m, B n and B* converge in norm to B and B*, respectively. 

We define the approximating measurement operator C p e L(E n , R p ) to be the 
operator whose matrix representation is the matrix C n in (4.1.8) (defined imme- 
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diately after (4.1.8)). This means that 


(4.2.19) C n - C| En . 

where C is the operator in (3.7.1)-(3.7.3). Since C is a bounded linear opera- 
tor with finite rank, C„P,- and C* converge in norm to C and C*, respectively. 

n t n n 

4.3 Approximation of the Hub-Beam-Tip-Mass Example 

Our approximation of the distributed model of the structure in Section 9 of 
Chapter 3 is based on a finite element approximation of the beam that uses cubic 
Hermite splines as basis functions [Scl, SF1]. Cubic Hermite splines and their 
first derivatives are continuous at the nodes, which are evenly spaced for the 
numerical results in Chapter 9. Because the basis vectors e. in Hypothesis 4.1 

J 

must be in the space V defined in (3.9.10), we write them as 
(4.3.1a) e x = (1,0,0), 

(4.3.1b) e. = (0, d>j , 0j(l)), j = 2, 3, .... n, 

where the 0 j 1 s are the cubic spline functions defined over the length of the beam 
and n-1 is the number of splines. For n g elements, there are 2n g linearly inde- 
pendent splines. The 2n g elastic degrees of freedom are usually taken to be the 
displacements and slopes at the nodes, relative to the rigid-body position. 

With the rigid-body mode (hub rotation), the total number of degrees of freedom 
is n » 2n 0 + 1. 

The matrices in (4.1.3) are calculated according to (3.9.3), (3.9.8) and 
(3.9.9), with Bg given by (3.9.5). In particular. 
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K n - [a(e j ,e J )] 1 t>" » c Q tc\ 

(4.3.2) 

Bq = Cl o 0 0] T 

= [<e i ,MQ 1 (l,0,0)> H ] = [< ei ,(1,0,0)> H ] 

Note that the first row and column of K n are zero. The matrices A n and B n are 
given by (4.1.6). 

If the measurement is the rigid-body angle 0, then the measurement matrices 
in (4.1.7) are C Q = 0 and 

(4.3.3) C n -- [1 0 0 0 ... ]. 
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5. Order Reduction Using Balanced Realizations 
5.1 Introduction 

Almost any engineering analysis and design task requires a model of a phys- 
ical system. The model should be complete enough to describe all of the rele- 
vant physical phenomena, but not so complicated that insight is lost or 
computational burdens become excessive. The development of an appropriate model 
is largely an art which depends on many factors such as the distribution and 
type of inputs and outputs, the magnitude and frequency content of inputs and 
outputs and the structure and properties of the physical components making up 
the system. Trial and error and engineering intuition are common ingredients in 
the modeling process. Much research has focused on methods for determining what 
is essential about a model and what can be disregarded. Most of this work, 
generally known as model reduction, applies to systems where the input-output 
relationship is linear. Fortunately, the structural systems considered here 
fall into that class. 

In addition to generating a low order model of the system which is to be 
controlled, one may wish to reduce the order of the controller or compensator 
which has been synthesized to accomplish the control. If the controller is 
linear, one may often use the same methods which are used for reduction of the 
plant model. This chapter deals with an approach for model reduction of linear 
systems which has been widely used since its introduction in 1980, the method of 
balanced realizations [Mol]. In its general form, the approach may be used for 
reduction of a plant model or controller. A simplified approximate version may 
be used when the equations have a special form corresponding to lightly damped 
mechanical systems. The method has intuitive appeal and is easy to apply. 
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A number of methods for model reduction have been used in the past with 
varying degrees of success. Modal truncation, for example, has been widely used 
in structures research. In this approach, the equations of motion are trans- 
formed into a set of uncoupled second order equations each of which describes 
the dynamics of one mode of vibration. Model reduction is accomplished by 
deleting those modes with the highest damped or undamped natural frequencies. 

The coupling of each mode to inputs or outputs does not play a role in the trun- 
cation procedure. The method is straightforward in its application and has 
intuitive appeal. It is an example of a method where reduction is accomplished 
by truncation after an appropriate coordinate transformation. 

Direct model reduction procedures have been proposed which do not involve 
truncation. In these approaches, the analyst decides in advance what order of 
reduced model he wants and then uses a numerical optimization procedure to 
obtain the reduced model of the given order which minimizes a specified measure 
of model accuracy. Methods of this type have recently been developed by Hyland 
and Bernstein for application to reduced order controller design [HB1], Their 
approach is called the Maximum Entropy Method, and it involves the simultaneous 
solution of two modified Riccati equations and two modified Liapunov equations. 
Numerical routines for solving these equations are not yet standard, but the 
authors have developed algorithms which appear to be effective. 

The introduction of the method of Balanced Realizations in 1980 revitalized 
the field of model reduction and led to renewed research. Various modifications 
and extensions have been proposed and studied. Enns developed an extension for 
frequency weighted balanced realizations [Enl] whereby the analyst could modify 
the basic balanced realization procedure to obtain relatively better reduced 
model accuracy in certain frequency ranges where model accuracy was critical. 
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Another approach to model reduction is the optimal Hankel norm approximation 
which is developed in great detail by Glover et al. in [GLl.Gll], Hankel norm 
approximations are closely related to approximations based on balanced realiza- 
tions, and in fact begin by expressing the system as a balanced realization. 
Although a smaller error bound can be established for Hankel norm approxima- 
tions, the computations required are substantially more involved than those for 
balanced realizations. Another feature of the Hankel norm approximation is that 
the reduced model may not be strictly proper even when the original model is. 

It is possible to alter the computations to ensure a strictly proper reduced 
model, but the required changes increase the error bound. 

New techniques for model reduction continue to emerge. Five model reduc- 
tion techniques (including balanced realizations, Hankel norm approximations, 
an approach by Davis and Skelton, an approach by Yousuf f and Skelton and a new 
method by Liu and Anderson) have been compared recently by Liu and Anderson 
[LAI]. Although some of the methods generally performed better than others, 
there was no one method which was always superior. Thus, model reduction is 
still an area of active research. The present chapter does not attempt to exa- 
mine and compare the many model reduction techniques currently available. 
Instead, its scope is limited to the method of balanced realizations. This 
approach has been found to compare favorably with other methods, and its com- 
putational simplicity makes it attractive for present applications. 

5,2 Observability and Controllability; Observability and Controllability 

Gramm Ians 

Balanced realizations are based on the notions of controllability and 
observability of linear systems. Thus we begin by defining these terms and 
exploring their meaning. Suppose a physical system is described by the state 
space equations: 
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(5.2.1) k = Au + Bu 

(5.2.2) y = Cx 
where 

x = n x 1 state vector 
y = r x 1 output/measurement vector 
u = m x 1 input/disturbance vector 

A, B and C are constant matrices of appropriate dimension. 

To introduce the notion of controllability, one might ask the following 
question: "Can one find an input u ( t ) which drives the system (5.2.1) from an 

initial state x(t Q ) = 0 to an arbitrary final state x f in finite time t f ?" If 
the answer to this question is yes, the system is said to be controllable. 
Otherwise it is said to be uncontrollable. To determine whether a system is 
controllable, first note that if x(t Q ) is zero, then 


(5.2.3) 


x(t) 


tf e A(t ~ T) B u(t)di 

. 

t 0 


Introduce a symmetric matrix W c (t Q ,t) defined as: 

‘ e Mt-T) BB T/(t-T) dt 
0 

W c (t 0 ,t) is called the controllability gramian. If W c (t 0 ,t f ) is nonsingular, 
a control Input u Q ( t ) which drives x from the origin to x f at time t f is given 
by 


(5.2.4) W c (t Q ,t) £ 


(5.2.5) 


T A (t--t) . 

u 0 (t) = Be W c( t 0 ,t f) x f 


This can be verified by substituting (5.2.5) into (5.2.3). Thus, if W c (t Q ,t f ) 
is nonsingular, the system described by (5.2.1) is controllable. In the event 
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that W c (tg,t f ) is singular, the system is said to be uncontrollable. 

The control input given by (5.2.5) is not the only one that will drive x 
from the origin to at time t f . Let Uj(t) be an input which drives x from the 
origin through an arbitrary trajectory and back to the origin at time t f . 

Because of the linearity of (5.2.1), any input which drives x from 0 to x^ at 
time t f can be written as 

(5.2.6) u(t) = u q ( t ) + Uj(t) 

Among all such inputs, Ug(t) is the one which minimizes the integral 

(5.2.7) E = j tf u T (t)u(t)dT 

t 0 

E may be regarded as a measure of control effort, and Ug(t) interpreted as the 
minimum effort input which accomplishes the desired change of state from 0 to 
x f . Substituting u Q (t) into equation (5.2.7) and evaluating the integral yields 
the minimum value of E. 

( 5 *2* 8 ) E min = x f ^W^f 

Three observations will be useful for subsequent work. 

Observation 1 

The control effort becomes small (i.e., the system becomes easier to 
control) when W c (tg,t f ) becomes large. 

Observation 2 

VI (tg.tf) can be expressed as: 
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(5.2.9) W c ( t 0 , t f ) = | tf ° e AA BB T e A A dA 

0 

To see this, let \ = t^. - t in (5.2.4). 
Observation 3 


If a system is controllable, it can be driven from an arbitrary initial 
state to an arbitrary final state. This can be seen by starting from an 
arbitrary initial state Xg, and expressing the final state as 


(5.2.10) x(t f ) = e 


f t f " v "f 


Xg + 


A(t f -t) 

e B u(T)dx 


So, 

(5.2.11) 


x(t f ) - e 


A(t f -t 0 ) 


x 0 = 


r l f A(t f -T) 


B u(x)dt 




This shows that the control which drives the system from Xg to x(t^) is the same 
as that which drives the system from the origin to the final state x f where 


(5.2.12) 


x f = x(t f ) - e 


A(t f -tg) 


The concept of controllability has several things in common with that of 
observability, but it has some differences as well. Assume A, B and C in Eqs. 
(5.2.1, 5.2.2) are known, and that the input u(t) and the output y(t) are known 
on tg < t 5 t f . Then ask the question, "Can one determine the initial state x Q 
from this information?" If the answer to this question is yes, the system is 
said to be observable. If the answer is no, the system is unobservable. 
Observability also implies that knowledge of the input and output over any 
interval t Q < t £ t^. is sufficient to determine x(t f ). 
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To develop the idea of observability, let 


(5.2.13) 


(5.2.14) 
Then, 

(5.2.15) 


x(t) S x(t) - 


1 e A(t_l) B u(T)dt 


y(t) S C x(t) 


ft 


y(t) = y(t) - 


C e A(t_T) B u(t)dt 


Since both terms on the right side of (5.2.15) are known, y(t) may be regarded 
as known also. Now, 

x = x-Bu-a| e A ^ t- ^B u(x)dt 


= Ax + Bu - Bu - A 


ft 


e A ( t x )b u(i)dx 


(5.2.16) 

and 


= A 


x - 


ft 


L 0 


e A(t- ^B u(t)di 


= A x 


(5.2.17) x(t Q ) = x(t Q ) 

Hence, the plant equations may be written in terms of x and y as 

(5.2.18) x = Ax, = 

(5.2.19) y = Cx 

The observability of (5.2.18) and (5.2.19) is the same as that of (5.2.13) and 
(5.2.14) since the ability to determine x(t Q ) given y(t) on [tQ.tj] implies and 
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is implied by the ability to determine x(t Q ) given y(t) on [ t Q , 1 1 ] and u(t) on 
[ ’ 1 1 ] * In the subsequent development we will work with (5.2.18) and (5.2.19) 
because of their simpler form. 

From (5.2.18) and (5.2.19), y(t) may be written as 


A(t-t Q ) X 

(5.2.20) y(t) = C e u x(t Q ) 


t < t < t f 


Define the observability Grammian as 


(5.2.21) W o (t Q ,t) £ 


e C C e dx 


A T (t-t Q ) 

Multiply (5.2.20) by e C and integrate from t Q to t f . 


(5.2.22) 


f 4 f A T (x-t ) 

e u C y(x)dx = 


^f A T (x-t n ) 


Q/ T A(l-tg) 

u c'c e u dt x(t 0 ) 


= ^o^O’^f ^ 

If W 0 (t Q ,t f ) is nonsingular, and y(t) is known on [t Q ,t f ], then (5.2.22) can be 
solved for x(t Q ). 


(5.2.23) 


= W o ^0’^ 


^f A T (x-t Q ) 
e 

fc 0 


C T y(x)dx 


Hence, the system (5.2.18), (5.2.19) is observable if W 0 (tg,t^) is nonsingular. 
If W 0 (t Q ,t f ) is singular, the system is unobservable. 


For interpretation of results in the sequel, it will be useful to introduce 
a scalar measure of the output y(t) analogous to the scalar measure of the input 
given in Eq. (5.2.7). Define a response function R as 
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(5.2.24) 


R = 


r t f 


y (x)y (T)dt 


Substituting from (5.2.20) and (5.2.21), 


y T (t) y(t) dt • x T (t 0 ) 


f ‘f A T (x-t 0 ) T A(t-t„) 

e u C C e u dt x(t Q ) 

0 


(5.2.25) 


= x T (t 0 ) W Q (t 0 ,t f )x(t 0 ) 


Thus, W Q (t 0 ,t f .) provides a measure of the size of the response corresponding to 
an initial condition x(t Q ). 


If the A matrix in Eq. (5.2.1) is stable (i.e., all of the eigenvalues of A 
are in the left half plane), then w c (tg.t f ) and W 0 (t Q ,t f ) remain finite as 
t^ ■+ -. Define 


(5.2.26) W c = W c (0,-) = | e At BB T e A T dx 

0 

(5.2.27) W Q = W o (0,») = | e A T C T Ce At dt 

0 

It is not difficult to show that W c and W Q satisfy the Liapunov Equations: 

(5.2.28) A W c + W c A T + BB T = 0 

(5.2.29) w 0 A + aT W o + cTc = 0 

Since well documented numerical procedures are available for solving these 
equations [BS1], finding and W Q for the case where t f ■+ • is most easily 
accomplished by solving (5.2.28) and (5.2.29) rather than attempting direct 
evaluation of the integrals given in Eqs. (5.2.9) and (5.2.21). If the given A 
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matrix is not completely stable, then A must be partitioned into a stable part 
and an unstable part. Eqs. (5.2.28) and (5.2.29) may be used to find the 
controllability and observability grammians associated with the stable part of 
A in the infinite time case. 

5,3 Balanced Realizations 

It is always possible to express the system equations (5.2.1) and (5.2.2) 
in a different but equivalent form through the use of a coordinate transfor- 
mation: 

(5.3.1) x = T x 

where T is a nonsingular matrix. This transformation leads to 

(5.3.2) x = Ax + B u 

(5.3.3) y = Cx 

where 

(5.3.4) A = T -1 A T, B = T _1 B, C = C T 

There are many objectives one might have in performing such a transformation, 
e.g., to put the transformed system into diagonal or block diagonal form, to put 
it in controllability canonical form [Kal, pp. 49-55], etc. When the trans- 
formed model is to be used for control system design, the Balanced 
Realization of Moore [Mol] is appealing because expressing equations in this 
form allows one to compare the relative controllability and observability of 
individual states. This information is useful in deciding which states are most 
important to retain in a reduced model. 

Suppose the original system matrix A is stable. (If this is not the case, 
then A must be partitioned into a stable and unstable part as noted above. The 
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discussion to follow applies to the stable part.) To cast the equations into 

balanced form, we seek a transformation matrix T such that W„ and W„ in 

c o 

Eqs. (5.2.26) and (5.2.27) are diagonal and equal. Suppose W c and W Q are found 
by solving the Liapunov Equations (5.2.28) and (5.2.29) for the untransformed 
system. R c and R Q for the transformed system then become 


(5.3.5) 0 = T _1 W T -T ; Q = T T W T 

The transformation matrix T is found by considering the eigenvalue problem 

(5.3.6) [W Q - AW' 1 ]Y = 0 

Assuming distinct eigenvalues, the eigenvectors satisfy 

(5.3.7) y]w- 1 Y i = 0; yjw^ =0 i * j 

Form a square matrix, r, from the eigenvectors of (5.3.6). The matrix r will 
satisfy 

(5.3.8) r T W Q r = diagonal, r" 1 W c r~ T = diagonal 

We have reached part of our goal by finding a matrix which diagonalizes both 

W c and W Q . However, the columns of r are not yet uniquely defined because we 

have not specified a normalization. This remaining degree of freedom is used to 

make W„ and equal. Normalize the columns of r such that 
c o 


(5.3.9) 
Then, 

(5.3.10) 


Y lVl ■ <»1> 


14 


W = W n = 
C 0 


i “ 1,2,.. .,n 


0 


Vi 


The A^'s are called Hankel singular values. Thus, when the original system is 
transformed using T = r, observability and controllability grammians satisfy 
(5.3.10). The transformed system is said to be a balanced realization. 
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5.4 Model Reduction Using Balanced Realizations 


We now consider the interpretation of R c and t? Q and examine how these 
matrices can be used to motivate a model reduction procedure. For a balanced 
realization, t? c and W Q are diagonal and satisfy (5.3.10). For this situation, 
the control effort function E and the response function R introduced in section 
5.2 become 

n x 2 . n 

(5.4.1) E = | u T u dx.£^; R = J y T y dx = x 0iN 

0 i=l 1 0 1=1 

If A i > A j , we can argue that less effort is required to move from the ori- 
gin to x f . than to move from the origin to x^. Thus, x^. is more strongly 
affected by u than x f j, i.e., x^.. is more controllable. Similarly, we can argue 
that an initial condition on Xg. results in a larger contribution to the 
response than an initial condition on Xgj , i.e., Xg. is more observable than 
Xgj. Taking these observations together, we see that a state x. with a large 
A^ is more controllable and observable than a state Xj with a small A y If 
model truncation is contemplated, it is intuitively appealing to retain highly 
controllable and observable states, i.e., states with large A's. Thus states 
corresponding to small values of A are candidates for truncation. If there is 
no gap separating the states with large A's from those with small A's, it may 
not be clear where the division should occur between the retained states and the 
discarded states. This issue may be resolved by considering the convergence of 
control and estimator gains. See Chapters 6-9 and [ MG 1 ] . 

5.5 Asymptotic Expansions for the Balanced Singular Values of Lightly Damped 

Mechanical Systems 

As explained in Section 4.2, calculating W c and W Q involves the solution of 
Liapunov equations. When Eqs. (5.2.1) and (5.2.2) describe a stable, lightly 
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damped mechanical system, the approximate solution of these Liapunov Equations 
may be accomplished using perturbation methods. This section concerns the 
development of first and second order perturbation solutions for W c , W Q and 
their balanced singular values. 

The mechanical systems addressed in this section are described by second 
order equations of the form: 

(5.5.1) Mq+eDq+Kq=bu 

(5.5.2) y = c p q + c v q 

M, D and K are symmetric and positive definite n x n matrices with M being the 

mass matrix, D the damping matrix and K the stiffness matrix. (At the expense 

of some additional complexity, the assumptions that D and K are symmetric could 
be relaxed in the development to follow. Ref. [BM1] addresses the case where a 
skew symmetric gyroscopic matrix G is added to D and a skew symmetric cir- 
culatory matrix F is added to K. The present development is restricted to the 
case where G and F are absent.) The matrix b is the input distribution matrix, 
and u is the input vector representing either the control or disturbance input. 
The output vector is y, and c p and c y are the position and velocity distribution 
matrices, respectively. The scalar e is a small parameter reflecting the light 
damping. By an appropriate coordinate transformation, these equations can be 
cast into a more convenient modal form: 

(5.5.3) n + e A fj + fi 2 n = B u 

(5.5.4) y = c p n + c v n 

In Eqs. (5.5.3) and (5.5.4), ft is a diagonal matrix of modal frequencies. The 
modal frequencies are assumed to be distinct. The damping matrix, A, is sym- 
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metric and positive definite, but it is not presumed to be diagonal. The matri- 
ces 0, C p and C v are the input and output distribution matrices for the 
normalized system. For the development to follow, we wish to express Eqs. 
(5.5.3) and (5.5.4) in state space form. Let 


(5.5.5) 




C - [c p 


B = 


0 

0 


Then the state space version of (5.5.3) and (5.5.4) becomes: 

(5.5.6) x = Ax + Bu 

(5.5.7) y = Cx 


Consider first the solution of the Liapunov Equation (5.2.28) for W c . Because 
of the special structure of A and B, it is useful to express W c in the par- 
titioned form: 


(5.5.8) 






W 22 

6 


The scalar e is introduced in Eq. (5.5.8) for convenience in the subsequent 
development. Substitution of (5.5.5) and (5.5.8) into (5.2.28) leads to three 
matrix equations: 


(5.5.9) 

W 12 + W 12 = 0 

(5.5.10) 

W 22 “ W 11 fi2 = eZ W 12 A 

(5.5.11) 

-(n 2 w 12 + w| 2 n 2 ) - (a w 22 + w 22 a) + bb t = o 
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These equations lead to some interesting observations. First we see that 
W 12 is skew symmetric. This results from the fact that the state vector x has 
the special form x = [n^ n^]^» i.e., the bottom half of the state vector is the 
derivative of the top half. 

A second observation is that Eqs. (5.5.9-5.5.11) contain e but not e. 

2 

This means that W^, are functions of e and expansions of these matrices 

2 

should be in terms of e and not e. 

With these observations in mind, we assume an expansion for W.^ in the 

form: 


(5.5.12) 

w u 

= U 1j+ £ 2 v j;i + e 4 w (J + ... 

Substituting 

this 

expansion into Eqs. (5.5.9-5.5.11) and : 

like powers 

of e 2 

to zero leads to a set of equations whii 

sequential ly 

• 


e° terms: 



(5.5.13) 

“12 

+ u 12 = ° 

(5.5.14) 

u 22 

- u n Q 2 = 0 

(5.5.15) 

-(n 

2u 12 + u 12 fi2 ^ ' ( Au 22 + U 22 A ^ + “ 0 


e terms: 


(5.5.16) 


v 12 + v 12 = 0 
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(5.5.17) 


e terms: 


v 22 " v li n = u 12 A 


(5.5.18) 


-(fi Z v 19 + vl 9 n 2 ) - (Av„ + v„A) = 0 


12 T 12 


22 T 22 


(5.5.19) 

w 12 + w 12 = 0 

(5.5.20) 

2 

W 22 " w ll^ + w 12 A = 0 

(5.5.21) 

-(Q 2 w 12 + w{ 2 fi 2 ) - (Aw 22 + w 22 A) = 0 

etc. 



Define u> 2 as the i th diagonal element of fi 2 , and A^ and (u k£ ).j 
ij elements of 33 T , A and u k£ , respectively. Then, using Eqs. (5.5.13 - 
and the assumption that the modal frequencies are distinct, the elements 
u^ become: 


as the 
5.5.15) 
of 


(5.5.22) (Ujj)^ = (38 T ) ii /(2u) 2 A ii ) ; (u^j =0 for i * j 
(u^i is diagonal.) From (5.5.22) and (5.5.14), U 22 becomes: 

(5.5.23) U 22 = u^fi 2 


(u 2 2 is also diagonal .) 

With the elements of u 22 known, introduce a matrix P defined as: 

(5.5.24) P = 83 T - (Au 22 + u 2 2 

(Note that P.^ = 0.) The elements of u 12 then become 
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(5.5.25) 


i * J 


( u 12)l j = P ij( w i ' w j) ; 

(5.5.26) (Ui 2 >ii = 0 
Next introduce matrices S and T based on the known solution for u 12 . 

A T 

(5.5.27) S = u 12 A - Au 12 

(5.5.28) T = -(A 2 Uj 2 + u 12 A 2 ) 


Then, from (5.5.16 - 5.5.18), the elements of become 


(5.5.29) (v u ),j = Slj A«f - «j) I 


1 * J 


(5.5.30) 


^ V 1 1 ^ i i = 


i -1 11 

T ii ‘ 2 H ( A ik w k( v ll)ki) " 2 JZ ^ A ik w k^ v ll\i^ 

k=l 


k=i+l 


2A ii w i 


Using this expression together with (5.5.17) we obtain 


(5.5.31) 


v 22 = v ll fi + u 12 A 


The expansion will not be carried beyond this point because the expressions 
become increasingly unwieldy. However, the expressions obtained so far permit 
one to write out the first three terms in the expansion for W £ . To see this, 
recall that 


(5.5.32) W c = 



W 12 


e 


w T 

w 12 

W 22 


e 



1 2 

eKl + e v n + ••■] t u i2 + 

[ u 12 + e v i? + •••] + 


'12 


e l 22 


e v 12 + .. 


e ^ 22 + . . 
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The expressions for u^, u^, 1 * 22 ’ v u and V 22 are given in E ^ s * ( 5 » 5 *22)- 
(5.5.31). 

For lightly damped structures (small e), the formulas given above provide 
an efficient way to compute an approximation of the controllability grammian. 
Using a similar approach, it is possible to develop approximate formulas for the 
observability grammian (see Appendix A). The resulting expressions for W c and 
W Q may be used to determine the balanced realization. Examination of (5.5.32) 
and the corresponding formula for the observability grammian reveals that as e 
gets smaller, causing the first terms in the expansions to dominate, W c and 
w tend to become diagonal. Thus, the product (W W ) tends to become diagonal, 

U v U 

and the diagonal elements approximate the squares of the balanced singular 
values. Using the results of this chapter and Appendix A, one can write these 
approximate expressions as 

[i th balanced singular value] 



Thus when e is small, the modal coordinates themselves constitute a balanced 
realization, and the i th balanced singular value is a measure of the importance 
of the i th mode of the model. A similar result was obtained by Gregory [Grl] 
using a different approach. 
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Subsequent chapters will address the question of how many modes and which 
modes must be included in a structural model in order for the compensator to 
converge and stabilize the nominal plant. The procedure for doing this involves 
beginning with a model of finite size and gradually adding modes to the model 
until the compensator converges. The speed of convergence and the ultimate size 
of the required model are dependent in part on how good the analyst is in 
selecting the modes to be added. The approximate formulas given above allow one 
to identify which modes are likely to be the most important (the highly 
controllable and observable ones with large balanced singular values) so they can 
be added to the model first. This procedure helps keep the size of the model 
from getting too large. 

5.6 Internal Balancing in Infinite Dimensions 

The idea of internal balancing in finite dimensions can be extended to 
infinite dimensions. This extension will yield balanced states for distributed 
systems, and these states will be vectors in the abstract state spaces (Hilbert 
spaces) in which the distributed models are formulated. 

Consider the control system 
(5.6.1) X = A x + B u , y = C x 

where x is in a Hilbert space H, A generates an exponentially stable semigroup 
on H, and B and C are bounded linear operators of finite rank. In applications, 

B and C have finite rank because the number of actuators and the number of sen- 
sors are finite. The controllability and observability grammian operators 
W c and W Q are the solutions to the operator equations 
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(5.6.2) 


A*W + W A + BB* = 0 and A W. + W A* + C*C = 0 . 
c c oo 

Since B and C have finite rank, both W c and W Q are compact and selfadjoint. It 
can be shown, then, that the eigenvalue problem 


(5.6.3) 


Vc Z 1 ’ 5 i z i 


has a countable number of solutions, that the singular values s. approach zero 
as i increases, and that the eigenvectors z. are mutually orthogonal with 
respect to both W c and W Q and complete in H. Note that these properties are 
quite similar to those of the eigenvalue problem for the natural modes of a 
flexible structure. The numerical problem of computing the balanced states of 
an infinite dimensional control system is also analogous to that of computing 
the natural models of a structure. 

The results alluded to here could be developed more fully and explored with 
respect to developing efficient numerical methods for computing the balanced 
states of the distributed system. The approximation theory for computing 
balanced states will be especially important. To the authors' knowledge, work 
has not been carried out on this topic. 
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6. The Infinite Dimensional LQG Problem 


Section 6.1 of this chapter presents some preliminary definitions and 
results for the optimal linear-quadratic regulator problem on an arbitrary real 
Hilbert space E. These results are generic in the sense that E is not 
necessarily the Energy space of Chapters 3 and 4, and the operators A, B, etc., 
do not necessarily represent a flexible structure. In subsequent chapters, such 
generic results facilitate derivation of approximation theory for the infinite 
dimensional state estimator from the analogous approximation theory for the 
control problem. 

Section 6.2 discusses an infinite dimensional state estimator for the 
generic plant in Section 6.1, an infinite dimensional compensator and the corre- 
sponding closed-loop system. The purpose of Section 6.2 is to establish certain 
properties of the estimator, compensator and closed-loop system that do not 
depend on the control and estimator gains being optimal. Therefore, the gains 
in Section 6.2 are allowed to be arbitrary bounded linear operators between 
appropriate spaces, and neither the estimator nor the compensator is necessarily 
optimal . 

Section 6.3 defines the LQG-optimal infinite dimensional estimator and com- 
pensator, still for a generic plant. Finally, Section 6.4 gives some important 
implications of the general results for the case where the plant is the infinite 
dimensional flexible structure model defined in Chapter 3. 

Most of the results in this chapter are analogous to results for the finite 
dimensional LQG optimal control problem, with infinite dimensional operators 
corresponding to finite dimensional matrices. In particular, instead of the 
control and estimator gain matrices in finite dimensions, here we have gain 
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operators. The functional gains in Section 6.4 can be thought of as a generali- 
zation of the transposes of finite dimensional gain matrices, although the rela- 
tionship between functional gains and infinite dimensional gain operators is not 
so simple. 

6.1 The Generic Optimal Regulator Problem 

Let a linear operator A generate a C Q -semigroup T(t) on a real Hilbert space 
E, and suppose that B e L(R m ,E) , that Q e L(E) is nonnegative and selfadjoint 
and that R is a positive definite, symmetric real m x m matrix. The optimal 
control problem on E is to choose the control function u e L 2 (0, R m ) to mini- 
mize the performance index 

.GO 

(6.1.1) J(z(0),u) = J (<Q z(t), z(t)> E + u(t) T R u(t)) dt 

0 

where the state z(t) satisfies 

(6.1.2) z(t) = T(t) z(0) + | T(t-s)Bu(s) ds, t :> 0. 

0 

Definition 6.1. A function u e L2(0, ®; R m ) is an admissible control for the 
initial state z, or simply an admissible control for z, if J(z,u) is finite; 
i.e., if the state z(t) corresponding to the control u(t) and the initial con- 
dition z(0) = z is in L 2 (0, ®; E). 

Definition 6.2. Let the operators A, B, Q and R be as defined above. An opera- 
tor n in L(E) is a solution of the Riccati algebraic equation if n maps the 
domain of A into the domain of A* and satisfies the Riccati algebraic equation 

(6.1.3) a* n + n a - n br _1 b* n + q = o. vvv 


70 



Theorem 6.3 (Theorems 4.6 and 4.11 of [Gi4]). There exists a nonnegative 
selfadjoint solution of the Riccati algebraic equation if and only if, for each 
z e E, there is an admissible control for the initial state z. If n is the 
minimal nonnegative selfadjoint solution of (6.1.3), then the unique control 
u(-) that minimizes J(z,u) and the corresponding optimal trajectory z(*) are 
given by 

(6.1.4) u ( t) = - R _1 B*n z(t) 
and 

(6.1.5) z ( t) = S(t) z, 

where S(t) is the semigroup generated by (A - BR _1 B*II). Also, 

(6.1.6) J(z,u) = min J(z,v) = <n z, z> E . 

v 

If, for each initial state and admissible control, 

(6.1.7) lim | z(t)| =0, 
t — >* 

there exists at most one nonnegative selfadjoint solution of (6.1.3). If Q is 
coercive, (6.1.7) holds for each initial state and admissible control and S(t) 
is uniformly exponentially stable. VVV 

We will refer to T(t) as the open-loop semigroup and to S(t) as the closed- 
loop semigroup. 

6.2 An Infinite Dimensional Estimator and the Corresponding Compensator and 
Closed-Loop System 

The differential equation corresponding to (6.1.2) is, of course, 

(6.2.1) z(t) = Az(t) + Bu(t) , t > 0. 
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While (6.1.2) is valid for any u e L 2 (0, R m ) , the sense in which (6.2.1) 
holds -- when it holds -- depends on how smooth u is. Thus it is more precise 
to write (6.1.2) for arbitrary L 2 controls, but continuing to write (6.1.2) in 
this section, along with the similar integral equations for the estimator and 
closed-loop system, would be too cumbersome. Therefore, we will write the dif- 
ferential equations with the understanding that when necessary they can be 
interpreted as representing the appropriate integral equations. 

We assume that we have a p-dimensional measurement vector y(t) given by 

(6.2.2) y (t) = C Q u(t) + Cz(t) , 

where C Q e L(R m ,R p ) and C e L(E,R P ) for some positive integer p. 

Definition 6.4. For any F e L(R P ,E), the system 

(6.2.3) £(t) = Az(t) + Bu(t) + F[y(t) - C Q u(t) - Cz(t)], t > 0, 

will be called an estimator, or observer, for the system (6.2.1)-(6.2.2). Let 
S(t) be the semigroup generated by A-FC. The observer in (6.2.3) is strongly 
(uniformly exponentially) stable if S(t) is strongly (uniformly exponentially) 
stable. VVV 

To justify this definition, we write 

(6.2.4) e(t) = z(t) - z(t) 
and, with (6.2. 1)- (6.2.3) , obtain 

(6.2.5) e(t) = $(t) e(0) , t > 0. 

Of course, an estimator is necessary because the full state z(t) will not be 
available for direct feedback, and the feedback control must be based on an 
estimate such as z(t). When the desired control law has the form 
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( 6 . 2 . 6 ) 


u(t) = Fz(t) 


for some F e L(E,R m ), the observer In (6.2.3) can be used to construct z(t) from 
the measurement in (6.2.2) and then the control law in (6.2.6) can be applied to 
z(t). The control applied to the system Is then 

(6.2.7) u ( t) = Fz(t) , 


and the resulting closed-loop system satisfies 


( 6 . 2 . 8 ) 


( z(t) ) - S 

4(t) ; " • 


.(t) 


( z(°h 

( U0) ) 


t £ 0, 


where S oo- (t) is the semigroup generated on E x E by the operator 


(6.2.9) 


A -BF 
FC [A-BF-FC]] 


D(A_) = D(A) x D(A) . 


With the estimator error e(t) defined by (6.2.4), it is easy to show that 
(6.2.8) is equivalent to (6.2.5) and 


(6.2.10) Z(t) = (A-BF) z(t) + BF e(t), t > 0, 

where (A-BF) generates a semigroup S(t) on E. Also, it is easy to prove the 
following. 

Theorem 6.5. Suppose that there exist positive constants Mj, M 2 , aj and a 2 such 
that 

-a.t -a-t 

(6.2.11) |S(t)| ^ M 1 e , |S(t) | < M 2 e t > 0. 

Then, for each real a 3 < min {aj, a 2 }, there exists a constant such that 

(6.2.12) |S TO .(t)| ^ H 3 e a3t , t * 0. 
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Also, 


(6.2.13) ofA^) = o(A-BF) U o(A-FC) , 
where o( A^) is the spectrum of A^. 

The observer in (6.2.3) and the control law in (6.2.7) constitute a compen- 
sator for the control system in (6.2.1) and (6.2.2). The transfer function of 
this compensator is 

(6.2.14) «(s) = — F (si - [A-BF+F(C 0 F-C)]) _1 F, 

which is an m x p matrix function of the complex variable s. When E has infi- 
nite dimension, the compensator transfer function is irrational, except in 
degenerate, usually unimportant cases. Figure 6.1 shows the block diagram of 
the closed-loop system produced by applying this compensator to the plant in 
(6.2.1) and (6.2.2). 
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z = A z + B u 
y = C Q u + C z 

y = (C Q + C [si - A]" 1 B) u 




Infinite Dimensional Plant 


i = [A-BF+F(C q F-C)] z + F y 
u = - F z 




u = - F ( s I - [A-BF+F(C 0 F-C)]) _1 F y 


Infinite Dimensional Compensator 

Figure 6.1 Closed-loop System with Infinite Dimensional Plant 
and Infinite Dimensional Compensator 

The foregoing definitions of this section and Theorem 6.5 are straightfor- 
ward generalizations to infinite dimensions of observer-controller results in 
finite dimensions. Balas [Ba2, Ba3] and Schumacher [Sc2] have used similar 
extensions. 


6.3 The Optimal Infinite Dimensional Estimator, Compensator and Closed-Loop 
System 


Now suppose that F is chosen as 
(6.3.1) F = n C* R -1 

where n e L(E) is the minimal nonnegative selfadjoint solution to the Riccati 
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equation 


(6.3.2) a n + n a* - n c* r ' 1 n + q = o, 

with Q e L(E) nonnegative and selfadjoint and R a positive definite symmetric 
p x p matrix. Theorem 6.3 (with A, B, 0. R, n and S(t) replaced by A*, C*, Q, 

R, n and S*(t) ) gives sufficient conditions for II to exist and for the semigroup 
S*(t) -- and equivalently its adjoint, the S(t) generated by A - nC*R - *C -- to 
be uniformly exponentially stable. 

Definition 6.6. When the control gain operator is 

(6.3.3) F = R -1 B* n, 

with n the solution to the Riccati equation (6.1.3), and the estimator gain 
operator is given by (6.3.1) and (6.3.2), we call the compensator and the 
closed-loop system in Figure 6.1 the optimal infinite dimensional compensator 
and optimal closed-loop system, respectively. 

The infinite dimensional estimator defined by (6.2.3), (6.3.1) and (6.3.2) 
is the optimal estimator for the stochastic version of (6.2.1) and (6.2.2) when 
(6.2.1) is disturbed by a stationary gaussian white noise process with zero mean 
and covariance operator Q and the measurement in (6.2.2) is contaminated by 
similar noise with covariance R. For infinite dimensional stochastic estimation 
and control, see [Bal, CP2]. When the state weighting operator 0 in (6.1.1) is 
trace class, the optimal infinite dimensional compensator minimizes the time- 
average of the expected steady-state value of the integrand in (6.1.1). 

Existing theory for stochastic control of infinite dimensional systems requires 

A. 

trace-class Q, but we have a well defined compensator for any bounded non- 
negative selfadjoint Q and Q, as long as the solutions to the Riccati equations 
exist. As the next chapter shows (without assuming trace-class 0). the 1 nf 1 - 
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nlte dimensional compensator is the limit of a sequence of finite dimensional 
compensators, each of which can be interpreted as an optimal LQG compensator for 
a finite dimensional model of the structure. Therefore, we do not require 
trace-class Q in our definition of the optimal compensator, even though this 
compensator solves a precise optimization problem only when Q Is trace class. 

This text is concerned primarily with how the finite dimensional compen- 
sators in Chapter 7 converge to the infinite dimensional compensator in this 
chapter, and the analysis of this convergence requires only the theory of infi- 
nite dimensional Riccati equations for deterministic optimal control problems 
and the corresponding approximation theory. While the stochastic interpretation 
of the infinite dimensional compensator and, in Section 4 of Chapter 7, of the 
finite dimensional estimators should be motivational, nothing in the rest of the 
paper depends on a stochastic formulation. We assume that the operators Q, R, Q 

A 

and R are determined by some design criteria, either stochastic or deter- 
ministic. In many engineering applications, deterministic criteria such as the 
stability margin and robustness of the closed-loop system [Bll, TS1], rather 
than a stochastic performance index and an assumed noise model, govern the 

A A 

choice of Q, R, Q and R. 

6.4 Application to Optimal Control of Flexible Structures 

For the rest of this chapter, A Q , , A, T(t), B Q , B, Cj, C 2 and C are the 
operators defined in Chapter 3, and E = V x H is the energy space defined there. 

The next theorem concerns a typical problem in control of aerospace struc- 
tures: All elastic components have some structural damping; no damping is asso- 
ciated with the rigid-body modes but all rigid-body modes are controllable. 
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(This is the case with the structure in Section 9 of Chapter 3, for example.) 

For all rigid-body modes to be controllable, an actuator is required for each 
rigid-body mode. 

Theorem 6.7. i) Suppose that Aj = B Q Bj and that A Q = A Q + Aj and d Q = d Q + Aj 
are H-coercive, so that there exist positive constants p, y and 0 such that, for 
all v e V, 

(6.4.1) |vi 2 ;> p | v | 2 

r \r 

(6.4.2) d Q (v,v) > Y | V | 2 

H 

(6.4.3) dg(v,v) < 0 | v ) ^2 
and 

(6.4.4) max{|B 0 |,|Q|,|R|} S 0. 

(The V-continuity of d Q implies (6.4.3).) Then (6.1.3) has a minimal non- 
negative selfadjoint solution n. 
ii) Suppose also that 

(6.4.5) <Qz, z> E £ p | z | |. 

Then the optimal closed-loop semigroup satisfies 

(6.4.6) |S(t) | < M 2 e _a 2 t , t > 0, 

where M 2 and a 2 are positive constants depending on p, y and 0 only. 

Proof. See [GA2]. 
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Now we will consider the structure of the optimal control law in more 


detail. Since n e L(E) and E = V x H, we can write 


(6.4.7) 


n = 


n. 


IL 


n Q e L(V), IIj e L(H,V) , n 2 e L(H), and n Q and n 2 are nonnegative and self- 
adjoint. With z = (x,x) , as in Chapter 3, (6.1.4) becomes 


(6.4.8) u(t) = - R _ 1 Bj[n* x(t) + n 2 x(t) ]. 

Since B Q e L(R m ,H), we must have vectors b. e H, 1 < i <m, such that 

m 

(6.4.9) B q u = Y^_ b i u i 

i = l 

for 

(6.4.10) u = [Uj u 2 ... u m ] T e R m . 


Also, for h e H, 

(6.4.11) Bgh = [<bj f h> H <b 2 ,h> H ... <b m ,h> H ] T . 

Since n*x(t) and n 2 X(t) are elements of H, we see from (6.4.8) and (6.4.11) 
that the components of the optimal control have the feedback form 

(6.4.12) u.(t) = - <f r x(t)> v - <g i ,x(t)> H , i = l, .... m, 

where f. e V and g. e H are given by 
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(6.4.13a) 


m 


f, - £ (R- 1 ),^. 
J-l 


m 


1 = ly2)*(«y[Tl* 


(6.4.13b) g, = Y^_ ( R ” 1 ) 1 j n 2 b J* 

j-l 

We call f. and g i functional control gains. 


Since the measurement operator C in (6.2.Z) now has the form discussed in 

a 

Section 7 of Chapter 3, the estimator gain operator F has the form 


(6.4.14) Fy 


■ L (f r »i> 


i=l 

l T . dP 


for y = [y 1 y 2 ... y p ] e R p , where the functional estimator gains f. and g^ are 
elements of V and H, respectively. 


For the optimal estimator gains, we can partition n as 


(6.4.15) n = 


n o n i 


s ; " 2 


and use (6.3.1) and (3.7.4) to get 


(6.4.16a) 


(6.4.16b) 


?i = ^ (R 1 )ij(Vlj + ”l C 2j^’ 

J-l 

= / ^ ^ij^O c lj + ^l C 2j^ ’ 

j-l 


i = l,2,»»»,p> 


Now we partition Q as 
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(6.4.17) 



In the optimal control problem, we almost always have a nonzero Qq because this 
operator penalizes the generalized displacement. For the results in this text, 

a 

Og can be nonzero in the observer problem, and, as in the control problem, some 
of the strongest convergence results for finite dimensional approximations can 
be proved only for coercive Q. However, if the observer is to be thought of as 
an optimal filter, then Q should be the covariance operator of the noise that 
disturbs (3.1.1). In this case, Qq = 0 and Qj = 0. 
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7. Approximation for the LQG Problem 
7.1 Preliminaries 

The approximating finite dimensional LOG problems to be solved numerically 
are defined for the n th approximating control system in (4.1.2) - (4.1.8). In 
approximating the solutions to the infinite dimensional LQR problem and the 
infinite dimensional state estimation problem, we will need the following 
2n x 2n grammian matrices: 

(7.1.1) K n = [<e. , e.> v ] = K n + [<A X e., e.> H ], 

(7.1.2) W n = 

(Recall the stiffness matrix K n and the mass matrix M n from (4.1.3).) The 
superscript n on any matrix indicates the order of approximation , not a power of 
the matrix; the matrix U~ n will be the inverse of l/ 1 . 

To construct the finite dimensional compensators, we must assume the 
following: 

Hypothesis 7.1. There exist two sequences of symmetric, nonnegative 2n x 2n 
matrices Q n and Q n , n = 1, 2, ... V7V 

The matrices Q n and Q n will be used in the Riccati matrix equations to be 
solved numerically for finite dimensional control and estimator gains. Of 
course, we want these matrices to be related to approximations of the operators 
0 and Q, respectively, but we will postpone hypotheses related to operator con- 
vergence until the next chapter. For now we just will give the most common way 

~n *n 
of defining Q and Q . 


K n 0 
0 M n 
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The matrix Q n most often is defined to be the nonnegative, symmetric 2n x 2n 
matrix 

foj Qj 

(7.1.3) Q n = 

Qj T Q n 2 

whose n x n blocks are 

Q 0 = [< e i» Q 0 ej> v ] , 

(7.1.4) Qj = [<e r Qj e J > v ] , 

Q" = [<e. , Q 2 ej > H ] , 

where Q fl , Qj and Q 2 are the operators in Chapter 6, Section 4. 

The most common way of defining the matrix Q n is similar to that for Q n , 
but there is an important difference. If the operator Q in Section 6.3 is par- 
titioned as in (6.4.16) and a symmetric matrix Q n is defined as in (7.1.4) with 
Q 0 , Qj and Q 2 replaced by Q Qf Qj and Q 2 , then 

(7.1.5) Q n = W“ n Q n W" n . 

Thus defined, the matrices Q n and Q n are related to approximations of the 

operators Q and Q in the following way. We define Q„ = P c QL and Q = P c Q| c 

n t n t n n t n t n 

where P F is the projection in (4.2.16), and we define Q n and Q n to be the 
t n 

matrix representations of Q n and Q n , respectively. It follows, then, that 

(7.1.6) Q n = W n Q n 

and 

(7.1.7) Q n = Q n W" n . 

It is interesting to consider the case Q = 6 = I. The state weighting 
operator Q = I (i.e., Q Q = I, Q 1 = 0, Q 2 = I) in the infinite dimensional 
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control problem penalizes the sum of the total energy in the structure and the 
squares of any rigid-body displacements. The operator Q = I in the infinite 
dimensional estimator problem corresponds to no physical noise; nonetheless, our 

A A 

approximation theory allows Q = I, and we often have found that a coercive Q 
produces desirable stability properties in the estimator. 

For Q » Q ■ I, (7.1.6) and (7.1.7) yield Q n = W n and Q n = W" n . This may 
appear dubious. Indeed, (7.1.6) and (7.1.7) may appear to violate the normal 
duality between the LQG control and estimator problems. However, examination 
of the performance index in (7.2.1) for the finite dimensional control problems 
and the stochastic interpretation in Section 7.6 of the finite dimensional esti- 
mators should demonstrate that (7.1.6) and (7.1.7) are natural from both the 
infinite dimensional and the finite dimensional perspectives. 

While the Q n and Q n in (7. 1.3)-(7. 1.7) often are used in the finite dimen- 
sional LQG problems, the rest of this chapter assumes only Hypothesis 7.1. 

7.2 Approximation of the Optimal Control Law 
7.2.1 The Finite Dimensional LQR Problems 

The n th optimal LQR problem is; given n(0) e R 2n choose u e L 2 (0,~;R m ) to 
minimize 


(7.2.1) J n (n(0),u) = J [ n(t) T o n n(t) + u(t) T R u(t)]dt 

0 

where n ( t ) = U(t) T ,£(t) T ] T satisfies (4.1.4). We assume: 

Hypothesis 7.2. For each n > 1 and n(0) e R , there exists an admissible 
control (Definition 6.1) for (4.1.4) and (7.2.1). 
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A sufficient condition for Hypothesis 7.2 is that, for each n, the system (A n , 
B n ) be stabilizable. 


By Theorem 6.1 (or standard finite dimensional optimal control theory), the 
optimal control u n (t ) has the feedback form 


(7.2.2) u n (t) = -F n n(t) 


where 


(7.2.3) F n = R' 1 B nT ff n 

and n n is the minimal nonnegative, symmetric solution to the Riccati matrix 
equation 


(7.2.4) A nT ff 11 + n n A n - n n B n R" 1 B nT fi n + Q n = 0. 


7 .2.2 Approximating Functional Control Gains 


Now we define the 2n x m matrix 


(7.2.5) 


G n = W n F nT 


and the following elements of V 


(7.2.6) 


n 2n 

in = Y- G Ji ®j’ 9 in = Y G. 
j-1 j-n+1 J 


ji e j ’ 


i = If 2) • • • $ m. 


Then, in view of (4.1.1) and (4.1.5), the i th component of the n th control law 
in (7.2.2) is 
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(7.2.7) 


u i n ( t ) = - [i th column of G n ] W n p(t) 
n n 

= ' Y1 H G j1 <e j ’ e k > V 
j=l k«l 

2n 2n 

' YL H G ji <e j ’ e k > H ^k (t) 
j=n+l k=n+l 

- ~ i = 1, 2, •••» m. 

We call f. and g in approximating functional control gains. In Chapter 8, we 
will see how f. and g in approximate the functional control gains f. and g i in 
(6.4.12). 


7.3 Approximation of the Infinite Dimensional Estimator 
7.3.1 The Finite Dimensional Estimators 

The finite dimensional state estimator that is used on-line to approximate 
the optimal infinite dimensional estimator in Chapter 6 is 

(7.3.1) n = A n n + B n u + F (y - C Q u - C n p) 

a 2n n n 

where n(t) e R and C is the matrix in Section 1 of Chapter 4. The 2 x p gain 

matrix F n is 


(7.3.2) 


F n = fi n (C n ) T R' 1 


where n n is the minimal nonnegative, symmetric solution to 
(7.3.3) A n n n + n n (A n ) T - n n c nT r -1 c n n n + q = o 


An 


and Q is a nonnegative, symmetric 2n x 2n matrix, as in Section 7.1. We assume 
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Hypothesis 7.3. For each n £ 1, there exists a nonnegative, symmetric solution 
to (7.3.3). 

A sufficient condition for Hypothesis 7.3 is that the pair (A nT , C nT ) be 
stabi 1 izable. If (A nT , Q n ) is detectable, then (7.3.3) has at most one non- 
negative symmetric solution. 


7.3.2 The Approximating Functional Estimator Gains 
We define 


(7.3.4) 




2n 

£ 

j=n+l 


ji C J’ 


i = 1, 2, 


P. 


where F n is the estimator gain matrix in (7.3.2), and we call f- n and g- n 

A 

approximating functional estimator gains. In Chapter 8, we will see how f. and 
9 in a PP roximate the functional estimator gains f. and g. in (6.4.13) and 

A A A A 

(6.4.15). We note that f- n and g^ n are elements of V n while f. and g i are ele- 
ments of V and H, respectively. 


7.4 The Finite Dimensional Compensators and the Realizable Closed-Loop Systems 

The nth compensator consists of the control law in (7.2.2) applied to the 
output of the n th estimator in (7.3.1); i.e., the control law for the compen- 
sator is 

(7.4.1) u n = - F n rj 

where the control gain matrix F n is given by (7.2.3). The block diagram in 
Figure 7.1 shows the realizable closed-loop system that results from the n th 
compensator. We will refer to this system as the nth closed-loop system. 
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n th Compensator 

Figure 7.1. n th Closed-Loop System 
The transfer function of the n th compensator is 

(7.4.2) # n (s) = -F n (sl - [A n -B n F n +F n (C 0 F n -C n )])" 1 F n , 

which is an m x p matrix function of the complex variable s for each n, as is 
the similar transfer function *(s) In (6.2.14) for the infinite dimensional 
compensator. 

7.5 Hilbert Space Representation of the Finite Dimensional Compensators and 
Associated Rlccatl Equations 

Recall from Chapter 4 the relationships among x n (t), z„(t), £(t) and n(t). 

n n • 

(See (4.1.1) - (4.1.6) and (4.2.1).) From here on the matrices Q n and Q n will 
be defined as In (7.1.3) -(7.1.5). 

We define n n to be the operator whose matrix representation is 
(7.5.1) n n = w“ n n n 

Since the matrix II is symmetric and nonnegative, the operator n n is selfadjoint 
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and nonnegative. The Riccatl matrix equation (7.2.4) is equivalent to 


(7.5.2) W" n (A n ) T W n n n + n n A n - n n B n R” 1 (B n ) T M n n n + Q n = 0. 

Since the matrix representations of the operators A n and A* are A n and 
W‘ n (A n ) T W n , respectively, and the matrix representation of B* is (B n ) T W n , 

(7.5.2) is the matrix representation of the Riccatl operator equation 

(7.5.3) A*n n + n n A n - n B + Q n = 0. 

v ’ nn nn nn nnn 


The control law in (7.2.2) is equivalent to 


(7.5.4) 


U_ ( t) = ~ F Z (t) 

n' ' n n' ' 


with the gain operator F n given by 


(7.5.5) 


F = R _ 1 B*n . 
n n n 


Next, we define n n to be the nonnegative selfadjoint operator whose matrix 
representation is 


(7.5.6) 


n n = n n w n . 


The Riccati matrix equation (7.3.3) is equivalent to 

(7.5.7) A n n n + n n w" n (A n ) T w n - n n w" n (C n ) T R _ 1 c n n n + Q n = o. 


which is the matrix representation of the Riccati operator equation 


(7.5.8) A n n n + n a* - n n c*R - 1 cn n + 6 = o. 

v ' nn nn nn nn n 

(The matrix representation of the operator C* is W" n (C n ) T .) 

The estimator gain matrix F n is the matrix representation of the operator 


(7.5.9) F n = n n C*R _1 . 
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2n 

If we write z p (t) = n^tje. , then the state estimator in (7.3.1) Is equiva- 
lent to i=1 


(7.5.10) 


= A 


n n 


+ B„ u 
n 


+ F„ (l r 


Cq “ 


- C. 


2 n>- 


For the convergence results in Chapter 8, it is useful to have the approxi- 
mating operators that are equivalent to the various matrices in Sections 7.1 - 
7.4. The finite dimensional Riccati operator equations (7.5.3) and (7.5.8) 
approximate, respectively, the infinite dimensional Riccati operator equations 
(6.1.3) and (6.3.2). The control and estimator gain operators in (7.5.5) and 
(7.5.9) approximate, respectively, the gain operators in (6.3.3) and (6.3.1). 
Also, the state estimator in (7.5.10), which is just the Hilbert space represen- 
tation of the estimator in (7.3.1), approximates the optimal infinite dimen- 
sional estimator in Chapter 6. 


7.6 Stochastic Interpretation of the Approximating Estimators 

Our approximation of the infinite dimensional estimator is based on approxi- 
mation of the infinite dimensional Riccati equation, whose structure is the same 
for both control and estimator problems, and stochastic properties of the opti- 
mal estimator problem never enter our approximation theory. Furthermore, using 
only the deterministic setting, in Chapter 8 we analyze the finite dimensional 
estimators and the compensators based upon them. Nonetheless, we should con- 
sider momentarily the sequence of finite dimensional stochastic estimation 
problems whose solution is given by (7.3.1) - (7.3.3) or equivalently by (7.5.8) 
- (7.5.10). 
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First, recall how the covariance operator of a Hilbert space-valued random 
variable is defined. The covariance operator of an E-valued random variable w 
is the operator Q for which 

(7.6.1) expected value {<z,w>^<z,w>^} = <Qz,z>^, z, z e E. 

(See [Bal , CP2], ) 

With F n given by (7.5.9) and (7.5.8), (7.5.10) is the Kalman-Bucy filter for 
the system 

(7.6.2) z„ = A z + B u + w n , 

' ' n n n n n 

(7.6.3) y = C Q u + C n z n + w Q , 

where w n (t) is an E n ~valued white noise process with covariance operator Q n and 
w Q (t) is an Revalued white noise process with covariance operator (matrix) R. 
Next, careful inspection will show that the filter defined by (7.3.1), (7.3.2) 
and (7.3.3) is the matrix representation of the filter defined by (7.5.10), 
(7.5.9) and (7.5.8). 

With z p and n related as in (4.1.1) and (4.1.5), (7.6.2) and (7.6.3) are 
equivalent to the system 

(7.6.4) q = A n n + B n u + v , 

(7.6.5) y = C Q u + C n n + , 

where v(t) is the R -valued noise process related to w n (t) by 
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n 

(7.6.6) w n (t) = Y^_ ( v i( fc ) e i‘ v i+n (t)e i^* 
i = l 

Certainly, a Kalman-Bucy filter for (7.6.4) and (7.6.5) has the form (7.3.1) 
with the filter gain given by (7.3.2) and (7.3.3). This particular filter is 
the matrix representation of the filter defined by (7.5.10), (7.5.9) and (7.5.8) 
if and only if the matrix Q n defined by (7.1.7) is the covariance of the process 
v(t). Since Q n is the matrix representation of Q , straightforward calculation 
using (7.1.7) and (7.6.1) shows that the Q n in (7.1.7) is indeed the correct 
covariance matrix. 

Of course, if w n (t) and v(t) represent a physical disturbance to the struc- 
ture, then w n (t) must have the form (0,w^(t)) and the first n elements of v(t) 
must be zero, but this is not necessary for our analysis. 

Our finite dimensional observers can be interpreted now as a sequence of 
filters designed for the sequence of finite dimensional approximations to the 
flexible structure, with the n tfl approximate system disturbed by the noise pro- 
cess w n (t), whose covariance operator is Q n . According to Hypothesis 8.5 of 

a 

Chapter 8, these covariance operators converge to the operator Q of Section 3 of 
Chapter 6. If we have a reliable model of a stationary, zero-mean gausslan 
disturbance for the structure, then we can take the covariance operator for this 
disturbance to be Q and think of the infinite dimensional observer as the opti- 
mal estimator. But this interpretation is not necessary for the rest of our 
analysis. 
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8. Convergence Theory for the Approximating LQG Problems 

8.1 Convergence Results for Approximation of the Generic LQG Problem 

Let the Hilbert space E and the linear operators A, T(t), B, 0 and R be as 
in Section 1 of Chapter 6. Suppose that there exists a sequence of finite 
dimensional subspaces E n , with the projection of E onto E n denoted by P En , and 
there exist sequences of operators A n e L(E n ), B n e L(R m ,E n ), Q n = Q* e L(E n ), 

Hypothesis 8.1. For each z e E, 

(8.1.1) PE n z -*• z as n ■* - , 

(8.1.2) exp(A n t)P En z - T(t)z 

and 

(8.1.3) exp(A*t)P Efl z -► T*(t)z as n -► - , 

uniformly in t for t in bounded intervals; for each u e R m , 

(8.1.4) B n u ■+ Bu; 

for each z e E, 

(8.1.5) Q n P En Z - Qz. 

(In other words, P En , exp(A n t)P En> exp(A*t)P Efl , B n and Q n P En converge strongly.) 

Hypothesis 8.2. For each n, the system (A n , B n ) is stabilizable and the system 
(Q„. A n ) is detectable. 

Hypothesis 8.2 guarantees that the Riccati equation (8.1.6) in the following 
theorem has a unique nonnegative, selfadjolnt solution. 


93 



Theorem 8.3. For each n, let II p e L(E p ) be the nonnegative, selfadjoint solu- 
tion to the Riccati operator equation 


( 8 . 1 . 6 ) 


a n + n a 

n n n n 


IT B n Rn 1 B * n n 

n n n n n 



If ||n || is bounded uniformly in n, then the Riccati algebraic equation (6.1.3) 
has a nonnegative, selfadjoint solution n and n p P Ep converges weakly to II; i.e., 

(8.1.7) <n n P Ef1 z, z> E -*• <nz, z> E V z, z e E. 

If additionally there exist positive constants M and b, independent of n, such 
that 


(8.1.8) l|exp([A n -B n R' 1 B*n n ]t)|| < M e _bt , t * 0, 

then n n P En and exp([A n -B n R _1 B*II n ]t)P En converge strongly to n and S(t), 
respectively; i.e., 

(8.1.9) n p P Ep z -► Pz V z e E 

and 

(8.1.10) exp([A n -B n R _1 B*n n ]t)P En z - S(t)z V z e E, 

uniformly in t £ 0, where S(t) is the semigroup generated by A-BR _1 B*P. If 

there exists a positive constant 6, independent of n, such that 

(8.1.11) Q n * S, 

then ||P n l| being bounded uniformly in n guarantees the existence of positive 
constants M and b for which (8.1.8) holds for all n. 
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Proof. The theorem follows from Theorem 5.3 of [Gi4] and Theorem 6.7 of [Gi3], 
whose proof is valid under the hypotheses here. In [Gil, Section 4] the opera- 
tors A n> Q n and II n are extended to all of E by defining them appropriately on 
E n . Banks and Kunisch [BK1] have modified Theorem 5.3 of [Gil] to obtain 
essentially the present theorem without using the artificial, and rather clumsy, 
extensions of E n in the proof. 

Theorem 8.4. The strong convergence in (8.1.9) implies uniform norm convergence 
of the optimal feedback laws: 

(8.1.12) ||B* n n P En - B* n|| - 0 as n - 


Proof. This follows from the selfadjointness of II n and P£ n and the finite 
dimensionality of the control space R m . See equations (4.23) and (4.24) of [Gil]. 

Nonconvergence results for the case where no open-loop damping is modeled 
are given in [Gil, Gi 2 , GA2J. 

For each n, the finite dimensional operator Riccati equation (8.1.6) is an 
approximation to the finite dimensional Riccati equation (6.1.3), but the solu- 
tion to (8.1.6) also provides the solution to the optimal control problem 
defined by 


(8.1.13) z„ = A z + Bu 

n n n n 


and 


(8.1.14) J n = j (<Q n z n , z n > E + u T Ru)dt. 


The optimal control law for this problem is 
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(8.1.15) 


where the operator F p e L(E n , R m ) 

(8.1.16) F = R -1 B* n n P En . 

v ' n n n t n 

P A 

Hypothesis 8.5. There exist sequences C e L(E ,R ) and Q e l(E ) with 

n ii n ' n ^ 

Q n = 0* ^ 0 such that 

(8.1.17) C n P En C strongly 

and 

(8.1.18) ^n P En ® strongly as n •. 

As with B n , (8.1.17) implies that C n P Efl and C* converge in norm to C and C*, 
respectively. 

Hypothesis 8.6. For each n, the system (A*. C* ) is stabi 1 izable. (In par- 
ticular, any unstable modes of the system ( A n * c n ) are observable.) Also, the 
system (A*, Q n ) is detectable. 

We define the output of the system in (8.1.13) by 

(8.1.19) y p = C 0 u + C n V 
The n th state estimator is 

(8.1.20) z = A z + B u + F (y - C n u - C n z n ) 

' ' n n n n n w n 0 n n' 

where the estimator gain is the operator 

(8.1.21) F_ = C* R" 1 

' n n n 
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and n n is the nonnegative, selfadjoint solution to the Riccati operator equation 

( 8 . 1 . 22 ) a n„ + n a* - fi n c* r _ 1 c n n„ + 6 = o. 

nnnn nn nn n 

Hypothesis 8.2 implies that such a solution exists and is unique. 

As n -, the estimator defined by (8.1.20) - (8.1.22) converges to the 
infinite dimensional estimator in Chapter 6 as indicated by the next two 
theorems, which follow immediately from Theorems 8.3 and 8.4. 

Theorem 8.7. Theorem 8.3 holds with A, B, Q, II, S(t), A n , B n> Q p and 
n n replaced, respectively, by A*, C*, 6, n, S*(t), A*, C*, Q n and n N . 

Theorem 8.8. If n n PE n converges strongly to n, then 

(8.1.23) l|n n C* - n C*|| - 0 as n ■» -. 

8.2 Convergence of the Closed-Loop Systems 

Now we will consider the sense in which the n th closed-loop system in Figure 
7.1 approximates the optimal closed-loop system in Chapter 6. Recall from 
Chapter 4 how the approximating open-loop semigroups T n (*) and their adjoints 
converge strongly and how the input operators B n> the measurement operators 
C n and their respective adjoints converge in norm. Section 8.1 has given suf- 
ficient conditions for the approximating control and estimator gains to converge 
to the gains for the optimal infinite dimensional compensator; i.e., (8.1.12) 
and (8.1.23) imply 

(8.2.1) ||F n P En - F || •* 0, 

(8.2.2) |F n - F|| + 0. 
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By identifying n with z n as in Section 5 of Chapter 7, we can identify the 
closed-loop system in Figure 7.1 with a closed-loop system on the space E x E n< 
We denote the corresponding closed-loop semigroup and its generator by S a>n (t) 
and A^, respectively. Recall S^t) and from Chapter 6. 

Proofs of the remaining theorems in this chapter are given in [GA2]. Since 
these proofs are rather technical and offer no particular insight into the 
compensator-design process, we do not repeat them here. 

Theorem 8.9. For t £ 0, S (t)P EE converges strongly to S^t), and the con- 
vergence is uniform in t for t in bounded intervals. 

We should expect at least Theorem 8.9, but we need more. We should require, 
for example, that if S(t) is uniformly exponentially stable, then S oon ( t) must 
be also for n sufficiently large. Although numerical results for numerous 
examples with various kinds of damping and approximations suggest that this is 
usually true, we have been unable to prove it in general. We do have the result 
for the following important case. 

Theorem 8.10. 

i) Suppose that the basis vectors of the approximation scheme are the natural 
modes of undamped free vibration and that the structural damping does not 
couple the modes. Then S a>n (t) converges in norm to S^t), uniformly in 
bounded t-intervals. 

ii) If, additionally, S ooaB (t) is uniformly exponentially stable, then S oon (t) 
is uniformly exponentially stable for n sufficiently large. 
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This paper emphasizes using the convergence of the approximating control and 
estimator gain operators F n and F , and the convergence of the functional gains 
that can be used to represent these operators, to determine the finite dimen- 
sional compensator that will produce essentially optimal closed-loop 
performance. However, close examination of - A (see [GA2]) shows that, 

for S „(t) to converge to S (t), we need the following differences to converge 
•n ** 

to zero: 

< 8 - 2 - 3 > Wen - BF ■ 8 „( F n P En ' F > + ( 8 „ ' 8 > F ’ 

< 8 - 2 - 4 > F n C „ p En ' FC = < F n ' F > c n P En + F < c n P En ' c >' 

The second term on the right hand side of each of these equations represents, 
respectively, control and observation spillover, which has been studied exten- 
sively by Balas [Ba2, Ba3]. Together, the control spillover and observation 
spillover couple the modes modeled in the compensator with the modes not 
modeled in the compensator. The spillover must go to zero -- as it does when 
B„ and C converge -- for A - A „ to go to zero. 

We should ask, then, whether there exists a correlation between the con- 
vergence of F n and F n and the elimination of spillover. The answer is yes if no 
modes lie in the null space of the state weighting operator Q in the performance 
index and if the assumed process noise, whose covariance operator is Q, excites 
all modes, but this correlation is difficult to quantify. The two main factors 
that determine the convergence rates of the gains are the Q-to-R ratio and the 
damping, neither of which affects the convergence of B n and C n . On the other 
hand, when either factor (small Q/R or large damping) causes the gains to con- 
verge fast, it generally also causes the magnitude of F and F to be relatively 
small, thereby reducing the magnitude of the spillover terms in (8.2.3) and 
(8.2.4). Also, as n Increases, the increasing frequencies of the truncated 
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modes usually reduce the coupling effect of spillover. This Is well known, 
although it cannot be seen from the equations here. In examples that we have 
worked, we have found that when n is large enough to produce convergence of the 
control and estimator gains, the effect of any remaining spillover is negli- 
gible. But this may not always be true, and spillover should be remembered. 

8.3 Convergence of the Compensator Transfer Functions 

The transfer function of the n th compensator (shown in the bottom block of 
Figure 7.1) is given by (7.4.2) and the transfer function of the infinite dimen- 
sional compensator is given by (6.2.14). Each transfer function is an m x p 
matrix function of the complex variable s. We will denote the resolvent set of 
[A - BF + F(CqF-C) ] by p([A - BF + F(C Q F-C) ]). 

Theorem 8.11. There exists a real number a 1 such that, if Re ( s ) > a^ then s e 
p([A n -B n F n +F n (C Q F n -C n )]) for all n, and * n (s) converges to 4(s), uniformly in 
compact subsets of such s. 

This result leaves much to be desired. For example, it does not guarantee 
that any subset of the imaginary axis will lie in p([A n -B n F n + P n (C 0 F n -C n ) ]) for 
sufficiently large n, even if all of the imaginary axis lies in p([a-BF + 

A. 

F(CqF-C)]). As with the convergence of the closed-loop systems, we can get more 
for certain important cases. 
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Remark 8.12. If the open-loop semigroup T(*) (whose generator is A) is an 
analytic semigroup, then there exist real numbers a, 0 and M, with 0 and M posi- 
tive, such that p([A-BF + F (C Q F-C) ]) contains the sector {s: |arg(s-a) | < ^ + 0}, 
and for each s in this sector, 

(8.3.1) ||(sl - [A-BF + F(C 0 F-C) ]) -1 || £ M/|s-a|. 


Theorem 8.13. i) If the basis vectors of the approximation scheme are the 
natural modes of undamped free vibration and the structural damping does not 
couple the modes, then each s in p([ A-BF + F(C Q F-C)]) is in p([A n -B n F n + 

F n (CQF n -C n ) ]) for n sufficiently large and ® n (s) converges to 4(s) as n ■* », 
uniformly in compact subsets of p([A-BF + F(C Q F-C) ]) . ii) If, additionally, 
T( • ) is an analytic semigroup, then ® n (s) converges to 4(s) uniformly in the 
sector described in Remark 8.12. 

Theorem 8.14. If A has compact resolvent, then ® n (s) converges to $(s) for 
each s e p([A-BF + F (C Q F-C) ]) , uniformly in compact subsets. 
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9. Application: Compensator Design for a Compound Structure 


9.1 The LOG Problem for the Distributed Model of the Structure 

Here we pose an infinite dimensional LQG problem for the hub-beam-tip mass 
structure in Section 9 of Chapter 3 and Section 3 of Chapter 4. In the infinite 
dimensional optimal control problem, we take Q = I in the performance index in 

(6.1.1) . This means that the state weighting term <Qz,z> E is twice the total 
energy in the structure plus the square of the rigid-body rotation. Since there 
is one input, the control weighting R is a scalar. 

According to (6.4.12), the optimal control has the full-state feedback form 

(9.1.1) u(t) = - <f,x(t)> y - <g,x(t)> H 
where x(t) has the form (3.9.1) and 

(9.1.2) f * (a f ,<K f ,B f ) e V , g - (a g ,<t> g ,0 g ) £ H. 

In (9.1.2) a f , 0 f , a g and 0 g are scalars and <j> f and <t> g are functions defined 
over the length of the beam. Note that = <|> f (il) is not used in the control 
law -- recall (3.9.8) and (3.9.9). 


The single sensor measures the rigid-body angle 9, and we assume that this 

-4 

measurement has zero-mean Gaussian white noise with variance R = 10 . Also, we 

model a disturbance on the right side of (3.1.1) that is a zero-mean Gaussian 
white noise process distributed uniformly over the beam and having concentrated 
components acting on the hub and tip mass. For this disturbance, the covariance 
operator Q in (6.3.2) is 


(9.1.3) 



0 

I 


V x H > V x H. 
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According to (6.4.14), with p = 1, the gain operator F for the infinite 
dimensional estimator has the form 

(9.1.4) Fy = (f ,g)y, 

A A 

and the functional estimator gains f and g have the form (9.1.2). 

9.2 Approximation of the Optimal Control Gains 

Now we solve the finite dimensional LQR problem in Chapter 7 for increasing 
approximation orders n. For each n, the solution to the finite dimensional 
Riccati equation (7.2.4) yields the matrix F n in (7.2.3). We use this control 
gain matrix to compute the approximating functional control gains in (7.2.6) and 
(7.2.7) and to compute the feedback law (7.4.1) for the finite dimensional com- 
pensator in Figure 7.1. Recall that n = 2n + 1 where n is the number of ele- 

C C 

ments used to approximate the beam. 

For the numerical solution to the n th approximating optimal control 
problem, we begin with the n x n matrices K n , D n and Bg in (4.3.2) and the 
n x n mass matrix M n computed with the inner product in (3.9.3) for the basis 
vectors in (4.3.1). With these matrices, we form the 2n x 2n matrices A n and 
B n in (4.1.6) and the n x n matrix K n and the 2n x 2n matrix W n in (7.1.1) and 
(7.1.2); K n is K n with 1 added to the first element. Since the operator 0 in 
the infinite dimensional control problem is the identity, the 2n x 2n matrix 
Q n in (7.1.3) and (7.1.6) is equal to W n . With these matrices, we solve (7.2.4) 
numerically by the standard eigenvector decomposition of the Hamiltonian matrix 
[KS1], often called the Potter method. 

The approximating functional control gains, computed with (7.2.6) with 
m * 1, have the form 
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(9.2.1) 


f n = (a fn’ *fn’ 0 fn^ ’ 9 n = (a gn’ ^gn’ 0 gn^ 

As n increases, these functional gains should converge to the functional gains 
in (9.1.2), which are optimal for the infinite dimensional control problem. 

-4 

For the damping coefficient Cg = 10 and the control weighting R = .05, 
Figures 9.2.1 and 9.2.2 show the functional gain kernels 4*^ and 4> gn computed 
with cubic Hermite splines and n g = 4, 6, 8 and 10 beam elements. Table 
9.2.1 lists the corresponding scalar components of the gains. We have plotted 
(Ji^ because the second derivative appears in the strain-energy inner product and 

r\ 

because 4> fn converges in H^(0,4) , so that <t>^ converges in l_ 2 . The numerical 

results indicate that 4)^ and 4> gn converge uniformly on [0,4], even though the 

Hermite splines yield a 4>^ that is discontinuous at the nodes. We have seen 

this convergence for the functional gains corresponding to beams in all of our 

structure-control examples, and we suspect that it is the case generally, 

although the convergence theorems in [GA2] guarantee only L 2 convergence for 

* ' ' and 4>„„. Since 8.., is not used in the n th control law, we omit it from the 
fn gn fn 

tables. 


"e 

a fn 

a gn 

0 gn 

4 

4.4721 

1.2440 

-133.87 

6 

4.4721 

1.2973 

-139.69 

8 

4.4721 

1.3106 

-141.15 

10 

4.4721 

1.3141 

-141.54 


Table 9.2.1. Scalar Components of Functional Control Gains 

-4 

Damping coefficient c Q = 10 ; Control weighting R = .05 

Number of elements n = 4, 6, 8, 10 

C 

Hermite Spline Approximation 
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9.3 Approximation of the Optimal Estimator Gains 

~ n t h 

To compute the gain matrix F for the n approximating estimator, we solve 
the Riccati equation (7.3.3), whose solution yields F n according to (7.3.2). 

The 1 x 2n matrix C n is given by (4.3.2), and 2n x 2n matrix Q n is 

(9.3.1) Q n 

according to (9.1.3) and (7.1.5). (As always, M _n is the inverse of the mass 
matrix. ) 

We compute the approximating functional estimator gains according to 
(7.3.4). Like the functional control gains, the functional estimator gains have 
the form 

(9.3.2) f„ = (a tn , * fn . (S fn ) , g n = (a fn , B fn ). 

As in the control problem, our convergence theory establishes only 
L 2 ~convergence for ({i^ and 4>g n , but the numerical results show uniform conver- 
gence on [0,£]. Since <J> fn (0) * <t>f n (0) = the convergence of (J)^ implies the 
convergence of 0^ n = <J>j. n (fi.). Thus, 0^ n is not an independent piece of informa- 
tion about the estimator gains while, as far as our convergence results go, B gn 
is. We maintain analogy with the control problem and list only 0 gn in the sub- 
sequent tables. Figures 9.3.1 and 9.3.2 show ^ and <J> , and Table 9.3.1 lists 
the scalars a^, a gn and 0 gn . 


0 0 
0 M" n ’ 
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n e 

a fn 

Cl 

gn 

e gn 

4 

5.3195 

14.149 

-1495.7 

6 

5.3567 

14.347 

-1517.5 

8 

5.3611 

14.371 

-1520.1 

10 

5.3623 

14.377 

-1520.8 


Table 9.3.1. Scalar components of Functional Estimator Gains 
Damping coefficient c Q = 10“ 4 ; 

Estimator R = 10~ 4 

Number of elements n g = 4, 6, 8, 10 

Hermite Spline Approximation 
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Hermite Spline Approximation Hermite Spline Approximation 






9.4 The Finite Dimensional Compensators 


Now we construct the finite dimensional compensator in Figure 7.1 for 4, 

6, 8 and 10 beam elements. Recall that (7.4.2) gives the transfer function of 
the n th compensator. Figure 9.4.1 shows the frequency responses (bode plots) of 
these compensators. The first graph in each figure contains the gain 
(magnitude) plots for all four orders of approximation, while the phase plot 
represents only the highest-order approximation. The finite dimensional compen- 
sators converge to the infinite dimensional compensator as n g increases. This 
is consistent with the convergence of the functional control and estimator 
gains. 
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9.5 The Closed-Loop Elgenstructure 


As (6.2.13) says, the spectrum of the optimal closed-loop system is the 

union of the spectrum of (A-BF) and the spectrum of (A-FC). If the damping 

operator D Q in (3.1.1) is bounded relative to Ajj for some y < 1 , then A has 

compact resolvent. In this case, (A-BF) and (A-FC) also have compact resolvent 

because BF and FC are bounded, so that the closed-loop spectrum consists only 

of isolated eigenvalues, each with finite multiplicity. If, as in our example, 

D 0 = CqAq with Cg > 0, A does not have compact resolvent; if the eigenvalues 
2 

of Ag are w^, o(A) contains at most a finite number of complex eigenvalues and 

2 2 

CgWj - 4) Wj/2, which approach -» and the continuous 
spectrum { - 1 /c Q } as j increases. When D Q = CgAg for any 0 < y < 1, the eigen- 
vectors of A are the same as those when Dg = 0, and hence are complete in E. 

For the remainder of this discussion, let us assume that, as in our 
example, the spectrum of A ( i - e. , the open-loop spectrum) consists of isolated 
eigenvalues with finite multiplicity and possibly a finite number of limit 
points, and that the eigenvectors are complete in E. (This is the case in our 
example.) Then, since BF and FC are bounded with finite rank, it follows from 
applying standard perturbation results [Ka2, pp. 208-214] that the asymptotic 
properties of both o(A-BF) and o(A-FC) are identical to those of o(A). This 
means that, beyond some number of eigenvalues, the eigenvalues of A-BF are vir- 
tually identical to those of A because the optimal compensator essentially 
controls only a finite number of modes. It also means that, although the opti- 
mal estimator contains copies of all the modes of the structure, essentially it 

observes only a finite number of modes and feeds virtually no sensor data into 

the copies of the rest. Of course, the infinite number of inactive estimator 

modes should be truncated before implementation. 


Ill 


We have not proved as much as we would like to about how the 
closed-loop spectrum -- i.e., the spectrum of the operator A discussed in 
Section 2 of Chapter 8 -- converges to the optimal closed-loop spectrum. In 
[GA2, Section 9], we showed that an extension of A converges in norm to 
A ^ (the optimal closed-loop generator in (6.2.9)) when the damping operator 
D q does not couple the modes of free vibration and the natural mode shapes are 
the basis vectors for the approximation scheme. We also showed that, when A has 
compact resolvent, (si - A^) -1 converges in norm to (si - A^) - *. In either of 
these cases, it follows from the section of Kato cited above that the eigen- 
values of A converge to those of A^. Our numerical results indicate that 
the eigenvalues of the n th closed-loop system based on the Hermite spline 
approximation also converge to those of the optimal closed-loop system, but we 
have not proved this. 

To compute the closed-loop eigenvalues, we approximated the flexible struc- 
ture (the plant) with 15 natural modes (obtained using 30 beam elements with 
Hermite splines) and formed the closed-loop system with the n th compensator for 
various values of n. The numerical results indicate that the optimal compen- 
sator ignores and does not affect any mode past the ninth (the eight flexible 
mode), and we are stretching matters to say that the ninth mode is controlled. 

Table 9.5.1 lists the eigenvalues for the closed-loop system consisting of 
the 15-mode model of the structure and the compensator based on 10 elements with 
Hermite splines. For this compensator, n = 2*10 + 1 = 21, so that the estimator 
contains approximations to the first 21 modes of the structure. Most of these 
approximations have not converged to the true natural modes, as can be seen from 
the imaginary parts of the eigenvalues under A-FC, which converge from above to 
slight perturbations of the open-loop frequencies. 
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The last twelve of the twenty-one modes in the 10-element Hermite spline 
compensator are inactive as far as the input/output map of the compensator. 

They do not couple with the rest of the compensator or with any modes in the 
structure, and they could be truncated without affecting the closed-loop 
response. The eigenvalues of the inactive compensator modes are equal to the 
corresponding open-loop eigenvalues of the ten-element approximation to the 
structure. The magnitudes of the real parts are larger than those for the 
corresponding true open-loop eigenvalues because the damping is proportional to 
the stiffness and the frequencies of the higher modes of the approximation are 
greater than the true frequencies. Since the inactive compensator modes reside 
in the n estimator, we list the corresponding eigenvalues under (A-FC). 

The closed-loop eigenvalues in Table 9.5.1 corresponding to modes 5, 6 and 
7 are not equal to the corresponding eigenvalues of ( A 2r B 21 F 21^ and 
(A2l"^21 C 2l) ‘ ( For each modes 5 ‘ B and 7 » we ^ sted one P air closed-loop 

a 

eigenvalues under (A-BF) and one pair under (A-FC).) This means that the 
Hermite-spline compensator for ten elements is not as close to convergence as 
the preceding functional gain and bode plots might suggest. Still, the closed- 
loop response produced by the ten-element Hermite-spline compensator should be 
close to optimal, except possibly for the sixth mode. Table 9.5.1 shows that 
this closed-loop system is stable. 
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Table 9.5.1. Closed-loop Eigenvalues with Compensator 
Based on Hermite Splines for 10 Elements 


OPEN- 

-LOOP 

A- 

BF 

A- 

FC 

REAL 

I MAG 

REAL 

I MAG 

REAL 

I MAG 

0.0 

0.0 

-0.03305 

0.0 

-0.27579 

0.33295 

0.0 

0.0 

-0.00799 

0.0 

-0.27579 

-0.33295 

-0.00005 

0.96522 

-0.11498 

0.96522 

-1.07059 

1.52394 

-0.00052 

3.21201 

-0.17909 

3.21192 

-0.83359 

3.27848 

-0.00242 

6.96055 

-0.15092 

6.95926 

-0.33913 

6.97026 

-0.00840 

12.96107 

-0.09170 

12.94415 

-0.14130 

12.98792 

-0.02280 

21.35301 

-0.03957 

21.33965 

-0.09902 

21.40999 

-0.05149 

32.08958 

-0.05132 

32.08852 

-0.08468 

32.23854 

- 0.10190 

45.14356 

- 0.10176 

45.14358 

- 0.11477 

45.54628 

-0.18305 

60.50577 

-0.18300 

60.50578 

-0.19281 

61.43464 

-0.30559 

78.17711 

-0.30557 

78.17711 

-0.32191 

80.02459 

-0.48184 

98.16564 

-0.48183 

98.16564 

-0.51073 

100.99211 

-0.72587 

120.48589 

-0.72586 

120.48589 

-0.87879 

132.55647 

-1.05361 

145.15908 

-1.05361 

145.15908 

-1.28294 

160.16850 

-1.48299 

172.21366 

-1.48299 

172.21366 

-1.89848 

194.84366 

-2.03407 

201.68604 

-2.03407 

201.68604 

-2.78066 

235.80614 





-4.03103 

283.90815 





-5.78013 

339.95430 





-8.15257 

403.71367 
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10. Other Applications Issues and Future Research Topics 


The methods developed or applied in this monograph provide a mathematically 
sound yet practical framework for designing feedback control systems for dis- 
tributed parameter structural systems. As in most research, development of the 
ideas presented here raises new questions as it answers current ones. In this 
chapter we mention some of the issues which have emerged during the course of 
this work. Some of these we have pursued to the extent of achieving promising 
but as yet incomplete results. Others we note as issues for future investiga- 
tion. The ideas will fall into three areas: 1) discrete time modeling and 

control, 2) robustness, and 3) model reduction. 

10. 1 Discrete Time Modeling and Control 

Recent research [GR1, GR2, GR3] has developed approximation methods and con- 
vergence results for discrete-time LQG control problems for distributed systems. 
Much of the discrete-time theory is analogous to that in this monograph for the 
continuous-time problem. The infinite dimensional discrete-time theory and 
approximation methods have been applied to control of flexible structures in 
[GR1, GR2]. Further research is needed on numerical difficulties encountered in 
approximate solution of infinite dimensional discrete-time Riccati operator 
equations. One respect in which the discrete-time problem differs from the 
continuous-time problem is that boundary control for flexible structures usually 
results in a bounded input operator for the discrete-time problem, whereas the 
input operators corresponding to boundary controls in continuous-time problems 
usually are unbounded. This is the subject of [GR2, GR3]. Further work is 
needed along these lines to expand the class of boundary operators and the class 
of unbounded measurement operators to which the discrete-time approximation 
theory applies. 
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Also, discrete-time input/output representation of infinite dimensional 
models of flexible structures has been investigated in [Jal, GJ1]. Infinite 
dimensional auto-regressive-moving average (ARMA) models of both the plant and 
the compensator have been developed for simple structures. Preliminary approxi- 
mation theory for such models has been developed, but considerably more work is 
needed to bring this point of view to the level of maturity and usefulness of 
the state-space methods for distributed systems. Such research should be par- 
ticularly important for adaptive identification and control of flexible struc- 
tures, as suggested by the large orders of finite dimensional ARMA models found 
necessary in [JG1, JG2] for adaptive identification of an experimental flexible 
structure at NASA Langley Research Center. 

10,2 Robustness 

An important feature of the approach developed in this monograph is that the 
order and modal composition of the model are automatically adjusted as a function 
of the performance objectives, disturbance environment, sensor locations and 
actuator locations. This is an effective way to deal with problems due to model 
truncation. However, there is nothing in the approach that explicitly addresses 
the issue of modeling errors due to poorly known parameter values (e.g., fre- 
quencies, damping constants, mode shapes, etc.). There are always some dif- 
ferences between a model and the physical system the model is supposed to 
describe. Since control system designs are based on models, one must always be 
concerned about the degree to which the theoretical performance is degraded due 
to differences between the model and the physical system. Hence, modeling 
errors of all types are a major concern in designing control systems. 

In addressing the modeling error problem, it is important to describe the 
modeling errors in a way that facilitates dealing with them. One approach is 
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to use the structured uncertainty description of Doyle [Dol]. Structured uncer- 
tainty models represent specific modeling errors (frequency errors, damping 
errors, mode shape errors, etc.) in the form of auxiliary multivariable feedback 
loops appended to the plant model. These descriptions may be used as the 
starting point for different approaches to robust controller design, e.g., 
Doyle's p synthesis approach [Dol] or the Linear Quadratic Gaussian (LQG) 
approach which is the focus of this monograph. In the context of the LQG 
approach, the forms of the auxiliary feedback loops motivate changes in the per- 
formance weighting matrix in the LQR problem and the disturbance noise 
covariance matrix in the K-B filter problem. These changes involve adding 
weighting matrices which reflect the specific nature of the uncertainty [BM1, 
TS1]. Making the additional terms large or small permits a controlled trade-off 
between performance and robustness. Thus one can arrive at a suitable balance 
between achieving a desired performance goal and minimizing the effect of struc- 
tured uncertainties in the plant model. 

Some results of using this approach are described in [BM1] and they show a 
marked increase in robustness with only a modest loss in performance. The 
robustness is not achieved by rolling off the loop gain before the first uncer- 
tain modes. Several of the uncertain modes are actively controlled. Numerical 
results in [BM1] focus on uncertain frequencies, but the methods used can be 
extended to other types of parameter errors. Work to study the nature and 
effectiveness of these extensions is currently in progress. 

Another approach for improving robustness involves using nonlinear 
programming to achieve robustness by sensitivity optimization, which means mini- 
mizing the sensitivity of closed-loop eigenvalues with respect to uncertainties 
in plant parameters. The method can also be used to optimize the shape of 
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structural elements to reduce structural weight. An overall control /structure 
optimal design can be achieved by using both of these features simultaneously. 
Appropriate constraints are imposed to ensure that high performance is main- 
tained. This approach to robustness is developed in papers [Adi, AG1, AG2]. 

The idea is to use nonlinear programming to reduce the sensitivity of the 
closed-loop eigenvalues with respect to modeling errors, while maintaining suf- 
ficiently high performance of the closed-loop control system. 

The paper [AG1] derives formulas for the sensitivities of closed-loop eigen- 
values with respect to uncertain plant parameters and presents a numerical 
example that demonstrates the effect of these sensitivities on robustness in 
control of a flexible structure. The analysis in [AG1] indicates that the 
first-order sensitivities of the closed-loop eigenvalues approach infinity as a 
controller and an estimator eigenvalue approach each other and suggests that 
robustness can be improved by separating controller and estimator eigenvalues. 
The numerical results for the flexible structure example in [AG1, AG2] 
demonstrate the improved robustness achieved by moving the estimator eigenvalues 
to the left of the controller eigenvalues. 

The work in [Adi, AG1, AG2] has led to a general guideline for choosing the 
state weighting matrix and the process noise covariance (the 0 matrices) in the 
LQG problem to improve robustness: After the state weighting for the control 

problem is chosen according to performance criteria, the Q matrix for the esti- 
mator design is chosen to move the estimator eigenvalues for the controlled 
modes with higher frequencies sufficiently to the left of the closed-loop 
controller eigenvalues to reduce the closed-loop eigenvalue sensitivities to 
acceptable levels. Examples of Q matrices that achieve this sensitivity reduc- 
tion are given in [Adi, AG1-AG2], In general, the estimator Q for a modal 
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representation of a structure is diagonal and its diagonal elements increase as 
the corresponding structural frequencies increase. 

References [Adi, AG2] discuss optimal eigenvalue sensitivity reduction in 
conjunction with optimal weight reduction by structural shape optimization. The 
idea is to combine minimization of closed-loop eigenvalue sensitivity with opti- 
mization of structural mass distribution, subject to constraints on eigenvalue 
location, to produce a robust controller, a light structure and a closed-loop 
system with fast response. While the measure of robustness used in the design 
objective Is the first-order sensitivity of the closed-loop eigenvalues, the 
final evaluation of the robustness of the design is based on large variations in 
the uncertain parameters. The numerical results in [Adi, AG1, AG2] demonstrate 
the effectiveness of the method for producing both a robust control system and a 
light structure. 

Recently, a significant body of literature has been devoted to Liapunov sta- 
bility methods for robustness analysis and design of control systems with struc- 
tured uncertainties [HB1, KB1, Yel, YL1 , ZK1]. These papers use quadratic 
Liapunov functions to estimate a region in parameter space for which a control 
system will remain stable. Such Liapunov robustness analysis is combined in 
[Bel, GB1, PHI] with nonstandard Riccati matrix equations for robust controller 
design. 

The main advantage of these Liapunov methods is that they can be applied in 
a straightforward and computationally efficient manner to complex, realistic 
systems. However, the methods generally yield conservative robustness estimates 
because, with the possible exception of the method in [HB1], all of the Liapunov 
robustness methods in current literature are zero-order in the sense that a 


119 



single Liapunov function is used for all permissible variations in the uncertain 
parameters. 

Recent research at UCLA [LG1] has produced a first-order Liapunov robustness 
analysis approach, which allows the Liapunov function to vary with the uncertain 
plant parameters. On most problems the robustness margins predicted by this new 
method are substantially larger than those predicted by the zero-order methods. 
If the first-order Liapunov robustness analysis continues to be as successful as 
it has been in early applications, an important line of research should be to 
develop related first-order robustness design methods motivated by the 
Riccati/Liapunov methods in [Bel, PHI]. Because of the theory and approximation 
methods presented in this text for infinite dimensional Riccati equations and 
the success of the applications to optimal controller design for large space 
structures, it seems reasonable to expect that this kind of theory and the asso- 
ciated approximation methods could be used to develop Riccati/Liapunov robust 
controller design methods for large space structures. 

10.3 Model Reduction 

The model reduction work described in this monograph deals with systems 
described by linear constant coefficient differential equations. This is a 
feature that is shared with most of the model reduction work reported in the 
literature. Equations of this type arise when one is studying small vibrations 
about a state of rest in inertial space and this is a situation of great 
interest for applications. Constant coefficient equations may also arise for 
some systems performing small vibrations about a state of steady motion (e.g., 
steady spin). In many cases of practical interest, however, the nominal motion 
either is periodic or consists of a single maneuver over a finite time interval. 
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If special inertial symmetry conditions are not satisfied, the linearized 
equations describing small vibrations about periodic motions generally have 
periodic coefficients, while a general finite time maneuver often leads to 
linear equations with time varying coefficients of a general form. A Dual-Spin 
Spacecraft where both bodies are flexible and neither body is inertial ly sym- 
metric is an example of a system where small variations about steady spin lead 
to linear equations with periodic coefficients. During steady state operation, 
the equations governing transverse attitude motion of such a spacecraft assume 
the form: 

(10.3.1) X = A(t)X ; X(0) = XO 

(10.3.2) A(t+T) = A ( t ) 

where T = 2 tt/w.. and w , is the relative rate between the two bodies, 
rel rel 

If one wishes to use the methods of this monograph to design a control system 
for such a spacecraft, it would be desirable to have a scheme similar to that 
described in Ch. 5 for obtaining lower order models of the system. 

Of the various methods available for model reduction, balanced realization 
theory is perhaps the best suited for extension to the time varying case. The 
application of balanced realization theory for time varying systems has been 
considered in the literature [SSI, VK1], but work in this area has not been 
extensive. Examples worked out so far have been simple and nonphysical. (In 
particular, these examples were not from the field of structural dynamics.) The 
potential of this approach for time varying structural systems is largely 
untapped, and it would make a suitable topic for further investigation. 
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APPENDIX A. Asymptotic Expansion for the Observability Grammlan 
The observability grammian satisfies the equation 

(A. 1) W q A + A T W Q + C T C = 0 

For the lightly damped mechanical systems of Section 5.5, 

0 I 

(A.2) A = , C = [C p C v ] 

-O 2 -eA 

The development of an expansion for W £ in Section (5.5) is simplified somewhat 
because the matrix BB T is zero except for the lower right hand quadrant. In 
contrast, the matrix CC T is generally full, so the development of an expansion 
for W Q is more complicated. For this reason, we will develop an expression only 
for the first two terms in this expansion. When CC T is full, there is no par- 
ticular advantage in assuming a form like that of Eq. (5.5.8) for W Q . Let 



A 

0 

I ' 



0 

0 

( A- 3 ) 

A = 

A o 

. -n 2 

0 . 
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A 1 * 

. 0 

-A . 


(A. 4) W 0 = j U + V + eW + ... 

Substituting (A.2-A.4) into (A.l) yields 

(A. 5) ^[UA 0 + Aj U] + [VA 0 + aJv + UA X + a[u + C T C] 

+ e[WA Q + aJw + VA X + a|v] + ... = 0 

Set the coefficients of like powers in e equal to zero to obtain 

(A. 6) UA q + aJu = 0 
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(A.7) 


VAq + aJ V + UAj + aJu + C T C = 0 
(A. 8) WA q + aJ W + VAj + A{V = 0 

etc. Writing (A. 6) in partitioned form leads to 
(A. 9) u 12 q 2 + ft 2 u{ 2 = 0 

(A. 10) u n - Q 2 u 22 = 0 ; u n - u 22 fl 2 = 0 

(A. 11) u^ 2 + u^ 2 = 0 

2 

It follows from (A.9-A.11) and the assumption that the frequencies in ft are 
distinct that 

(A. 12) u 12 = 0 

(A. 13) u llf u 22 are diagonal 

Thus, 

Q 2 u 22 0 

(A. 14) U = 

0 u 22 

To determine the elements of the diagonal matrix u 22 , write (A.7) in partitioned 
form 

(A. 15) Vj 2 + Vj 2 - A u 22 - u 22 A + C y C v = 0 

(A. 16) -ft v 22 + + C p C v = 0 ; - v 22 ft + CyC p = 0 

(A. 17) -ft 2 v{ 2 - v 12 ft 2 + Cjc p = 0 
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The diagonal elements of v 12 can be easily found from (A. 17) 


(A. 18) 


< v 12>11 * 


( C P C P^1 

2 u>? 


Using (A. 18) and considering the diagonal terms in (A. 15) one may derive ex- 
plicit expressions for the diagonal elements of u 22 . 


(A. 19) 


(u 




2 


22'i i 


2 w‘ 


A ii 


Equations (A. 14) and (A. 19) define the dominant first term in the expansion for 
W Q . To determine the second term in (A. 4), we proceed to examine the off 
diagonal terms in (A.15)-(A.17). This leads to 


(A. 20) 


^ v 12^i j 


(fopVl * u X e v>t.1 ~ “i ( * u 22 + u 22*>1.1 . 


7 2 

Wj - u>. 


i * J 


(A. 21) 


(v U^ij 


. M fop C v*U " w i^ C P C v^ji 


7 2 

W 1 - w j 


II 


i * J 


(A. 22) 


(v 22^ij 


(C T C - (C T C 
v P v ; ij v P v y ji 


w 


w 


j 


i * J 


Equations (A. 18) and (A. 20) define v 12 completely, and (A. 21) and (A. 22) define 
the off diagonal terms of and v 22 . To find the diagonal terms of and 
v 22 , Eq. (A. 8) must be written in partitioned form. 

(A. 23) w 12 + wj 2 - Av 22 - v 22 A = 0 

2 2 T 

(A. 24) -fl w 22 + - v 12 A = 0 ; w n ~ w 22 Q - Av J2 = 0 
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(A. 25) 


-fl 2 w T 
u w 12 


- w 12 fl 


= 0 


From (A. 25) we find that 


(A. 26) 


( w l 2 )ii = 0 


Setting the diagonal elements of (A. 23) equal to zero and using (A. 26) yields 


n 

(A.27) (v 22 ) n - jj- 51 t 4 k1 (v 22>k1 + < v 22>1k 4 ki • 

1.-1 


k=l 

k*i 


m 

■4 £ 


k=l 

k*i 


^ C p C v\i ' ^ C p C v^ik 


«? 


- w. 


Then, considering the diagonal terms in (A. 16), 


(A. 28) 


(v ll^ii = ^ v 22^1 i " < C p C v> 


Thus Eqs. (A. 14, A. 19, A.20-A.22, A.27, A. 28) represent the first two terms in 
the expansion for W Q . 
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